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INTRODUCTION 



This volume brings together 11 papers written over the past 
few years by faculty and students and others connected in some way 
and at mm time with OSU. Some of the papers, i» particular those 
by Miller, Johnson, Rupee, Stollenwerk, Valimaa-Blum, and Lee, were 
originally written for graduate-level lecture courses and seminars, 
while the others represent independently motivated work. 

The common thread holding these papers together is that they 
all touch on issues relevant to historical linguistics and to the 
descrption and explanation of language chenge. They effectively 
reflect current work being done in historical linguistics in general 
and moreover are representative of those aspects of historical 
linguistics that are considered especially important in the OSU 
Linguistics Department. Thus, there is a definite sociolinguistic 
thrust to this collection, with several papers—those by Clark & 
Joseph, Rupee, Stollenwerk, Valimaa-Blum, and Lee— focusing on 
different aspects of dialect borrowing in language change (though 
each with its own special perspective, e.g. Clark & Joseph's on 
using a particular dialect borrowing explanation in etymologizing, 
Lee s on the relevant evidence for the interpretation of the extent 
and direction of change from earlier stages of Indie, etc.) and on 
the relevance of social factors in the spread of linguistic 
innovations. 

At the same time, though, the sore formal side of language 
chenge, along with other—especially internal— types of motivation 
for change is attended to in other papers, most notably those by 
Nevis, Miller, and Joseph, and in the Janda & Joseph collaborative 
effort, a general theory of morphological change is advanced which 
is tested anil elaborated upon in Johnson's paper. 

It shoulc be noted as well that change in virtually all 
levels of language is covered, including phonetic change 
(Stollenwerk), phonological change (Janda & Joseph, Johnson), 
morphological change (Nevis, and, again, Janda fc Joaeph, Johnson), 
morphosyntacic change (Miller), syntactic chauge (Joseph, and, 
again, Nevis), lexical change (Rupee, Valimaa-Blum, Clark & Joseph), 
and to a limited extent, semantic change as well (in Rupee's paper). 

Finally, the variety of languagea covered ia noteworthy: 
Sanskrit, Middle Indie, Greek, Arabic, Old English, Modern English, 
Finnish, Saame, and Estonian each foim the major focal points of one 
or more of the papers, reflecting our belief that to understand the 
general phenomenon, language change, one must investigate a wide 
range of natural languages. 

I would like to thank Sung-Ae Rim, Marlene Payha, and Hideo 
Tomita for their assistance in the production of this volume. 
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Decliticization and Deaffixation in Saame: 
Abessive taga * 

Joel A. Nevis 



1 . Introduction 

Agglutination is an extremely common diachronic process in the 
languages of the world. As one of the oldest and best-known theories 
of the genesis of affixes, it has been used widely as a method of 
reconstructing constituent order, as in Givon's (1971) now well-known 
slogan "Today's morphology is yesterday's syntax." According to the 
agglutination hypothesis, affixes are historically former words that 
have lost their independence and have cliticized onto a neighbor, only 
later to fuse into the host as an affix, as in (1). 

(1) WORD > CLITIC > AFFIX 

There have been recent attempts to constrain and even refute this 
methodology," e.g. by Anderson (1980) and Comrie (1980) among others. 
In general, though, linguists have accepted the agglutinative cycle of 
words, even if only as a general tendency. 

There are only a few instances of the opposite direction of 
change in the literature on agglutination (e.g. Jeffers and Zwicky 
1980, Jandn 1981 : ... h an affix has become a clitic or a clitic 
has become a word: 

(2) AFFIX > CLITIC > WORD 

I adopt some relevant terminology from Jeffers and Zwicky (1980). The 
reanalysis of a word as a clitic is called cliticization and the 
reanalysis of a clitic as a word dccliticizaticn . The reanalysis of a 
clitic as an affix is affixation and that of an affix as a clitic is 
doaf fixation : 

(3) Cliticization: WORD > CLITIC 
Decliticization: CLITIC > WORD 
Affixation: CLITIC > AFFIX 
Deaffixation: AFFIX > CLITIC 

In Saame (I.Hppish) deaffixation and decliticization are possible 
developments. The abessive morpheme is traditionally viewed as a case 
ending, but T will argue that it is in fact a clitic in most varieties 
of N. Saame and Kildin Saame, and a full word in the Enontekio variety 
of N. Saame (section 2). Afterwards I will demonstrate that the 
abessive originates historically as an affix (section 3 and 4). 



2 • -Kyj-d etice for Synchron i c Loose Status 

The motivation for the clitic postposition status of abessive 
l n&. ( liiiga) comes from Us syntactic, properties. It exhibits the 
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syntax and morphology of regular postpositions in that it governs the 
genitive on the preceding noun phrase, it permits "conjunction 
reduction", it attaches outside possessive enclitics, and it disallows 
adjective-noun concord . A typical case suffix, in contrast, attaches 
to a stem rather than a fully formed genitive noun phrase, does not 
permit "conjunction induction", attaches inside possessive enclitics, 
and allows adjective concord wherever appropriate. 

The paradigm in (-1) shows the oorphotnet ics of n noun. Notice 
that case morphemes precede possessive morphemes. This is entirely 
expected since the case endings are generally 1 true suffixes and the 
possessives nre enclitics (Collinder 1949: 7, 1957: 193-94) - clitics 
always attach externally to affixes when the two cooccur (cf. Zwicky 
1977). 

(4) POSSESSIVE PARADIGM for ak'ko 'grandmother' 

plus possessive 'my' (from Itkonen 1960:46-49) 





SINGULAR 




NOMINATIVE 


ak'ku-m 




GENITIVE 


ak'ku-m 




ACCUSATIVE 


ak'ku-m 




ILLATIVE 


ak'ku -s§- 


m 


LOCATIVE 


akko-stS- 


•m 


ESSIVE 


ak'ko nS- 


m 


COMITATIVE 


akko"in§- 


•m 




PLURAI. 




NOMINATIVE 


akko-idS- 


•m 


GENITIVE 


akko -idfi 


m 


ACCUSATIVE 


akko-idS- 


•m 


ILLATIVE 


nkko-idS- 


sa *m 


LOCATIVE 


akko-i-nS-m 


ESSTVE 


nkko-narn 


COMITATIVE 


akko-idS-m-guim 



(5) MORPH0TACTICS of the Locative Plural 

akko-(idS) -sa -m "to my grandmother' 
STEM-<(NUMBER)-CASE--POSS 

In those dialects and languages that permit possessive plus abessive 
at all, the abessive noun phrase has the opposite ordering, with 
possessives preceding the abessive morpheme: 

(6) M0RPH0TACTICS of the Abessive (Ter dialed of Kildin Saarae 

as reported by Szabo 1984: 1G8) 

alaga-n-ln 'without my son' 
son-lSG-ABES 

Either (6) displays an endoclitic possessive ju oi ,\s I artfue hei e 
— the abessive is not a true suffix. 
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Although adjective-noun concord is very limited in Saame, it does 
appear in certain demonstrative and interrogative pronouns, in 
cardinal numbers and in the adjectives buorre 'good* and bSh§ *bad\ 
True case suffixes exhibit agreement, e.g. in (7), but postpositions 
do not. The abessive patterns with the postpositions in this regard, 
as in (8) below. 6 

(7) AGREEMENT — Locative Plural (Korhonen 1981:315) 



No case suffix permits "conjunction reduction" (to use 
transformational terminology), yet the abessive allows it, even 
prefers it. Compare the conjoined noun phrases in (9) and (10), where 
(a) represents the full versions and (b) the reduced versions. 
Conjunction reduction is also preferred for postpositional phrases, as 
in (11), where relevant postpositional phrases are bracketed for ease 
of exposition. Again the syntactic behavior of the abessive parallels 
that of the postpositions. 

(9) SUFFIX — Comitative Singular (Sammallahti 1983:56) 



father(GEN) and Issat-COM 
'with father and Tssat* 

(10) ABESSIVE (Bartens 1984) 

a. airoj-taga ja borjas-taga 
oar-PL-ABES and sail-ABES 

'without oars and without a sail 

b. airoj ja borjas-taga 
oar-PL and sail-ABES 
'without oa-s and a sail* 

(11) POSTPOSITIONS (Bartens 1978: 17,77) 

a. ja ruohta (alemus varidi nala] ja (kaisait nala] , 
and runs highest mountains up and summits up 

kos lae kalosaebbo. 
where is cooler 

'and runs up the highest mountains and up the 
summits where it is cooler.* 



burii-n mani-i-n 'to the good children' 



(8) 



NO AGREEMENT in Abessive (Sammallahti 1983: 174) 
ciezaid gapmagi i-haga 'without seven shoes' 
♦ciezai-haga gapmagi i-haga 



a. .Ihciin ja Issahiin 
father-COM and Issat-COM 
'with father and with Issat' 



b. *Ahci- ja Issahiin 





b. ja ruhtet (cuoikait ja bahka sivas] ala vara 

and run mosquitos and heat reason high mountain 

luokait . 
along 

'and (they) run because of the mosquitos and heat 
along the high mountain 1 

Finally, in all of the above examples the abessive governs the 
genitive on the preceding noun phrase. This is typical of regular 
postpositions in the language. By comparison, true case suffixes 
either have concord between appropriate modifier and head noun, or 
else require a (default) attributive form for the modifier. 

In summary, the abessive behaves synchronically like a 
postposition and thus should not be considered a case suffix. The 
only property distinguishing -taga and other postpositions is its 
attachment to the preceding noun. The nature of this attachment is 
not clear to me ct this point — there is no word-internal 
phonological evidence to prove that the abessive attaches 
phonologically. Therefore it does not deserve to be called a clitic. 
Instead, it appears that -taga is simply a stressless postposition, 
which cannot accept stress under any circumstances (Sammallahti 
1983). I conclude that abessive - taga is synchronically a semi-clitic 
postpositi on.- 

In the Enontekio dialect of Northern Saame, the abessive has 
complete phonological independence and is not even a semi-clitic (much 
less clitic) postposition: 

(12) Western Enontekio: mun bahcen haga 'I remain without 1 
Eastern Enontekio: mun b&hcen taga *I remain without 1 
I go without 

In these parallel examples, haga and taga appear as adverbs and do not 
require a host for cli ticization or prosodic learning. 

3. Comparative Evidence for Former Affix Status 

To return to the historical side of the topic, it could be 
argued that Saame retains what was in Proto-Finno-Ugric a full word, 
which in all other Finno-Ugric languages underwent cliticization with 
subsequent affixation. This is certainly a plausible parallel 
development, considering that most Finno-Ugric languages are 
agglutinative by nature. But comparative evidence uncovers certain 
idiosyncrasies in nearly all the sister languages, demonstrating that 
the abessive morpheme should be reconstructed as an affix in the 
purent language. 

Most of the Finno-Permie languages have cognate abessive case 
endings H3); the Ugric branch exhibits a somewhat different cognate 
set. The proto- form from which these Finno-Permic abessive are 
derived is caritivti ♦ pta plus lative t^k. 



(13) ABESSIVE CASE ENDINGS (Bartens 1984) 

Southern Saame -pta, -t'ta 

Northern Saame -t'ta, -taga 

Finnish ■■tto , f -US' 

Mordva # 

Mari -tS, -de* 

Udmurt -tek 

Komi -teg 

Northern Saame taga displays a pleonastic lative *;*/-n (Korhonen 
1981, Bartens 1984): — 

(14) taga < *pta+k+(e)k/n (Caritive *pta + Lative *-k + 

Pleonastic Lative *-k/-n) 

This caritive element appears also in an adjectival suffix — in 
the Finno-Permic languages it is formed with a -ma suffix (15a); in 
the Ugric languages it has an $^1 suffix (15b), 3 

(15) ADJECTIVAL CARITIVES (Bartens 1984) 

a. N. Saame -tabme 
Finnish -ton, -ttoma- 

Mordva -vtomo, -vteme; -ftama, -ftama, -ft^m? 

Mari -Um, -'oam 

Udmurt -tem 

Komi -tern 

b. Hanti (-tarn, -tem, borrowed from Komi) 
Mansi -tal, -tal 

Hungarian -talan, -tlan, -telen, -tlen 

It is thus unlikely that * zE ta was an independent word which 
cliticized in all of the sister languages except for a few varieties 
of Saame. And additional evidence shows that cliticization would have 
had to take place at an early stage in the development of the 
Finno-Permic languages. Most of these languages exhibit relic verbal 
abessives in which the nominal abessive (* -pta+k ) is attached to the 
verbal base, as in (16) below. Since case affixes do not generally 
attach to verb stems, or else require an intervening nominalizing 
suffix, the relic forms are unpredictable and, again, not a likely 
parallel development. Therefore the bound nature of the * -pta 
morpheme is due to genetic inheritance from the Finno-Ugric parent 
language. 

(IB) VP!*" 'HESSTVE (Bartens 1984) 

N. Saame (taebme) 

Finnish -tta', -tta 1 

Mordva £ 

Mari -de 

Udmurt *tek 

Komi -tgg 

Hanti -lay, -J?i*k, -Jjajf 

ERIC 12 
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The comparative method dictates that we reconstruct a 
(derivational) suffix * -pta which in conjunct with lative *iH formed 
an abessive case ending and which in combination with derivational 
suffix ♦ -ma (or *_1 foi in) formed a caritive adjectival suffix. 
Thus I posit the following development: 

(17) LATE mOTO -FlNNO-rGMC 



*-pta 




*-ptu-k *pta-ma 
[N, VJ *?ta-l 

[aim] 

The methodology requires that the source for Northern Saarae be a 
suffix and that deaff ixnt ion and deel iticization he innovations in 
Northern and Kildin Saame. 

4. Language-Intc-nnl Evidence for Former Affixal Status 

Language-internal evidence also indicates that the independence 
of taga is an innovation. In most dialects of Northern Saame there is 
another abessive allomorph, namely -t' ta (Col Under 1957: 190, 
Sammallahti 19R3: 167-Gfl) , which does not have the word- like 
characteristics of taga. The -t'ta allomorph occurs with trisyllabic 
stems, as in (IB). 

(18) gahmfisfi-t* ta •without a (reindeer) skin' 

(Collinder 1957: 190) 

The taga allomorph occurs with stems having an even number of 
syllables: 

(19) dolfi-tfigfi 'without fire' (Collinder 1957: 190) 

Although the two are in complementary distribution, the taga allomorph 
is now spreading at the expense of the -tMa allomorph. This is 
allowed because of stem allomorphy in the paradigm, whereby stem 
allomorphs can alternate according to number of syllables, as in (20). 

(20) gabmasfi- 1 ' ta — gabmas-ta*g5 "without a (reindeer skin)' 

(Collinder 1957:19) 

5. Summary 

The scheme that. T offer here requires that a former affix 
sequence ota-k(-k/-n) . which formed that abessive, have acquired a 
certain amount of phonological independence in several varieties of 
Saame and later (in Enontekio) complete independence; these 
developments are illustrated in (21): 

(21) -pta-k-ek/n > -ptakek/-p taken >*> -taga > taga/haga 

AFFIXES AfFI.X CLITIC WORD 
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Northern Saane has a semi-clitic abessive -taga which used to be a 
true affix. In the Enontekio subdialect it has come to stand as an 
independent word. I have explained that these two taga morphemes have 
cone about through, first, deaffixation and, then, decliticization. 
The Saane data discussed here constitute a good example of the 
opposite of agglutination: bound forms can acquire phonological 
independence to become independent words. I conclude that "today's 
syntax can be yesterday's morphology. 



* This paper was composed at the Ohio State University, though I 
prepared the final version wMle on the faculty of the University of 
Michigan. The paper was presented to the 1985 ISA Winter Meeting in 
Seattle. Transportation to that meeting was prov 4 2ou in part by 
funding from the OSU Linguistics Department. *ne ideas expressed here 
in are the result of research carried out in 1983-84 while on 
fellowship at the University of Helsinki (Supported by the 
American-Scandinavian Foundation and by the United States Educational 
Foundation in Finland). I benefited greatly from the input provided 
by University of Helsinki Professors Raija Bart ens and Mikko Korhonen, 
and I hereby express my gratitude to them. 

1. The morphotactics of comitative plural guim reveal that it, 
too, is a clitic postposition rather than a true suffix. See Nevis 
(1986ms). 

2. A semi-clitic is a prosodic leaner, and does not interact 
phonological ly with the host. A veritable clitic has clear 
phonological interaction with the host. See Nevis ( 1985a, b) for a 
discussion of the relevant terminology with respect to Finnish and 
Estonian. 

3. In the discussion at the LSA meeting, Robert Austerlitz 
suggested that the -ma suffix has the same etymology as the deverbal 
suffix una of Finnish. This would mean that the * pta + k combination 
(i.e. derivational plus inflectional suffix) merged into a case 
ending, while *p_ta + ma (two derivational suffixes) fused into a 
single adjectival suffix. If * pta was a derivational suffix, "hen the 
relic verbal abessive forms are no longer a mystery. Although 
inflectional affixes are generally selective in stem selection, i.e. 
limited to a single word class, derivational affixes can sometimes be 
promiscuous in stem selection, relying on semantic rather than 
syntactic information 

4. This is Janda's paraphrase of Givon's paraphrase cited above 
in the first paragraph. However, neither Janda's data nor my own 
prove that deaffixation and decliticization have any effect on 
constituent order. 



Notes 
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Decliticization in Old Estonian 3 



Joel A. Nevis 



1. Introductory Remarks 

Agglutination is a universal diachronic process among the languages of 
the world. As one of the oldest and best-known theories of the genesis of 
affixes, it has been used widely as e method of reconstructing word order — 
as in Givon's (1971) well-known slogan "Today's morphology is yesterday's 
syntax." Givon's methodology has been constrained by some (e.g. Anderson 
1980, Comrie 1980, and others) and has been refuted by yet others. In 
general, though, linguists have accepted the agglutinative cycle of words, 
whereby affixes are historically former words which have lost their 
independence and cliticized onto a neighbor, inevitably later fusing into the 
host as an affix. 

According to Givon's principle, an affix continues the positioning within 
the word that its word source had within the phrase. Comrie points out by way 
of criticism that some clitics exhibit a special attraction to the position 
after the first constituent of a clause — a positioning not shared by full 
words. However, I have argued that these clitics are phonological ly dependent 
syntactic words (Nevis 1985a); in addition, examples of full words occupying 
this "second position" slot are not uncommon (Nevis 1985a, Wackernagel 1982). 
The clitics in question are generally sentential in scope (Kaisse 1985), and 
are called 'second position' or 'Wackernagel- type' clitics. Second position 
clitics have a peculiar resistance to completing the agglutination cycle, so 
that Comrie 's remarks are not to be rejected out of hand after all. 

In Baltic Finnic one finds several Wackernagel-type clitics that have 
been diachronical ly stable. Interrogative *ko, informal *s, and emphatic *oa 
exist in most BF languages today as clitics, and continue that status from the 
parent language, Late proto-BF. In Estonian these morphemes have been lost as 
clitics, but instead of becoming affixes, they have either decliticized into 
seperate words or disappeared altogether. 

1.1. On the Agglutination Hypothesis 

I adopt here the terminology of Jeffers and Zwicky (1980). By 
cliticization I mean the reanalysis of a word as a clitic. Decliticization is 
the reverse metanalytic reinterpretation of a clitic as a word. Affixation is 
intended here to be a diachronic process: the reanalysis of a clitic as an 
affix. Conversely, deaffixation is the change from a former affix *"o a clitic. 



(1) a. WORD 



> 

cliticization 



CLITIC 



> 

affixation 



AFFIX 



(2) a. WORD 



< < 
decliticization 



CLITIC 




AFFIX 
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Samples of the agglutination cycle (i.e. (1) above) are numerous. 
Examples of loosening (i.e. (2) above) are rare. One 8U ch example can be 
found in Jauda (1981), who examine* the history of the English genitive Barker 
s, arguing that it is a clitic in Modern English with a source in Old English 
as an affix. 

Janda argues that deaf fixation in this instance case about in Middle 
English through the homophony of genitive ^es and unstressed pronoun (h)ys 
'hiss e.g. my moder vs sake (see Janda 1981 for sore examples and details of 
the analysis). Janda also suggests that, had it not been for this homophony 
and deaffixation, Middle English would have lost genitive ^ along with all 
the other case endings (as has happened in the Northern British dialects — * 
Janda 1981: fn. 4). Janda's analysis is not uncontroversial; 1 Carstairs (1985), 
for example, has an alternative account of deaffixation in Middle English. 

Jeffers and Zwicky (1980) likewise offer an analysis from reconstructed 
particles in Pro to- Indo-European that putatively falsifies the "tacit 
assumption that that clisis is invariably one stage in an inexorable 
development toward the status of an affix or toward ultimate oblivion". 
Actually there is no evidence to prove the clitic status of the particles in 
their example — Second Position cannot be used as an indication of clitichood 
here (Nevis 1985a). Even Wackernagel (1892) acknowledged the availability of 
this slot for unstressed words in general (what he called quasi-enclitica). 

Nevis (1985b) offers an instance of both deaffixaton and decliticization 
of an inflectional morpheme in Northern Lappish. 

There are other examples of deaffixation and decliticization in the 
literature, but these usually involve changes in the lexicon (i.e. they are 
derivational morphemes). Several these examples are discussed by Vesikansa 
(1977) for Finnish. A common example in many parts of Europe is the 
decliticization of -ism (see Ariste 1968-69). In English, Finnish, and 
Estonian, one can speak of all kinds of "isms", with ism itself having the 
meaning 'doctrine, theory*. 

As a reaction to the Agglutination theory of the origin of affixes, Tauli 
(1953) tells us, Alfred Ludwig 2 postulated the Adaption theory, and later 
Jespersen (1922) the Secretion theory. Both theories entail metanalytical 
reanalysis. Underlying Ludwig' s Adaptation theory is a reanalysis of 
derivational or emphatic elements as inflectional. Jecpersen's Secretion 
theory involves a reanalysis of "one portion of an originally indivisible word 
as coming to acquire a grammatical significance" (1922: 77). The possibility 
of metanalytical reinterpretation in morpho-syntactic change also permits the 
change from bound morpheme (i.e. affix or clitic) to full word. 

Old Estonian offers a further example of decliticization. In Old 
Estonian two Second Position Clitics, namely interrogative es and emphatic eg, 
are frae words showing no phonological interaction with a preceding word. 
Following Ariste (1973) and Alvre.(1976, 1981), I argue that these two words 
have their source in Proto-Baltic Finnic Second Position clitics #s and *pa. 

1.2. On Clitics 



A clitic is a morpheme (possibly morphologically complex) having a mixed 
word/affix status. This is to say that it has some properties of words and 
some properties of affixes. Both diachronically and synchronically the clitic 
aDDears to be intermediate between the word and the affix. 
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Continuing along the lines of previous work (Nevis 1985a), I adopt the 
position that clitics are not primitive units of languages. Rather they fali 
into two classes of derived phenomena: (phonological ly) bound words and 
phrasal affixes. The former is a special kind of word, the latter n typo of 
affix. The bound word, in particular, has the syntax of words, but the 
phonological properties of affixes. 

The Finnish clitics -pa/-pa", -ko/ko, and -han/-han are bound words, as 
argued by Nevis (198fia). Although positioned with respect to the sentence as 
a whole, these morphemes cannot stand alone as independent words. Their 
phonological subordination to a preceding word is demonstrated hy the 
application of vowel harmony. Vowel harmony operates over the phonological 
word, never beyond. Thus it can he used as a test for clitichood in Finnish: 



tuuli-han 'the wind, you know* * tuuli-han 
tyyli-han *style, you know 1 — #tyyli-hHn 



The Baltic Finnic clitic, as represented by Finnish, are Wackernagel-type, 
or Second Position (2p), clitics. These clitics are bound words that, in an 
otherwise free constituent order language, occur strictly positioned after the 
first constituent of the clause and enclitic on it. See section 2.1. for 
examples. 

Second Position clitics (or actually Second Position bound words) have a 
particular resistence to affixation — both to phrasal affixation and to 
regular affixation. I demonstrate this point with data from Old Estonian. 
The Proto-Baltic Finnic Second Position clitics inherited by Estonian show 
some instability, namely deel iticization. On the basis of the Baltic Finnic 
data, I offer a general account of possible and probable developments for 
Wackernagel-type clitics. 

2. Ep and Es in Estonian 

There are no Second Position clitics in Modern Estonian. From the point 
of view of her sister languages, a gap exists in Estonian. To account for 
this gap, one assumes that deel iticization has taken place only in Estonian. 
This analysis is supported by two kinds of evidence. There is first an 
argument hased on complementarity — where the other Baltic Finnic languages 
have 2p clitics, Old Estonian has full words. The second argument relies on 
shared relic forms in all the Baltic Finnic Languages. Some support comes 
from a third source — relic forms in Estonian alone. 



2. 1 . Wackcrnagel's Law 

Cognate morphemes in sister languages are Second Position, or 
Wackernagel-type, clitics. That is, they are phonological ly bound words which 
are attached to the first constituent of a clause (no matter what that 
constituent may be). Corresponding to Estonian emphatic eg are emphatic 
clitics in the various sister languages: 



(3) Finnish -pa/-pa Tule-pa kotiin 

come-EMP home 
Karelian -bo mida-bo 

what-EMP (Ahtia 1036:0) 
Lyydi -bo mida-bo 

what-EMP (Larjavaara 1979:109) 
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Similarly, the BF cognates to Estonian interrogative §s include Finnish, 
Karelian, Lyydi, Vepsa ^s, which indicates not interrogativity, but 
informality (i.e. ^ is a register marker). I account for the semantic 
discrepancy below in section 3.4. 



(4) Finnish -s: tule-s tanne 
come IF here 



Karelian -s: ottakkoa-s 'take (2PL IMP)' 

take(2PL)-IF (Ahtia 1936: 132) 

These are a subset of the various BF sentential clitics that obey 
Wackernagel's law. Other such clitics include BF H<o, northern BF -hanAhan , 
Lyydi -Vepsa s|, and a few more clitics (Penttila 1957, Ahtia 1936). 

(5) Finnish -han/-han anna-han menna 'let her/him/it go* 

give -HAN go 

Votic -ko evat-ko sii 'don't they eat?' 

not-Q eat (Ahlqvist 1858:5) 

Lyydi se kod'ii se ruoh nu tulda en 

home EMP dar«. come not 
*Home I didn't dare (to) come' 
(Larjavaara 1979: 116-17) 

On the basis of comparative evidence, we want to reconstruct for 
Proto-Baltic Finnic at least three Second Position clitics: emphatic *pa, 
informal #s, and interrogative *ko. The other clitics are more recent 
innovations. There are few traces of these clitics in Modern Estonian. For 
this reason one looks to ej> and es, which correspond roughly in meaning and 
positioning, as continuers of the clitics. 



Beyond the evidence presented in the following section, it is not 
entirely clear that *£a and *ko were true clitics and not just quasi-enclitics 
(i.e. stressless words) in the parent language. It turns out that their 
clitichood has no bearing on the decliticization analysis I present below, 
since clitichood can he established for Old Estonian on the basis of internal 
reconstruction. 



2. Estonian ep 



Ariste (1973) contends that Estonian affirmative emphasis marker ep is a 
direct continuation of Proto-Baltic Finnic clitic *pa. It appears in roughly 
the same sentential slot as -pa in other BF languages, and it has the same 
meaning. Ariste cites a number of examples from turn-of-the-century Estonian 
and from the modern dialects. I repeat select ones below; for a more complete 
listing I refer the reader to Ariste (1973). 

Emphatic ej> is generally located after the first constituent of the 
sentence, as in the following examples: 

(6) selle kivi peat ep koltfitigi neid riideid 
this lock on EMP pounded these clothes 
*on this stone one pounded the clothes' 
(Ariste in7.1:.1.1; originally from Saareste 1958) 
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(7) Mis ep saame nana 
what EMP get see 
'What do we get to see' 

(8) Need ep vist sinu omad ongi? 
these EMP probably your own is-EMP 

'These are probably your own?' 

(9) Siis ep ilmub Isakene, Tuleb taevast taadikene. 
then EMP appears daddy comes sky-EL granddaddy 

'Then Daddy appears, granddaddy comes from the sky.' 

(10) See ep siis tuli, et temal venl tihli sobradele oli anda 
there EMP then came that him still often friends-AL was give 

'There then it came that he still often had a gift for friends* 

(11) See ep see on, mis suurem rahvahulk, 
chat EMP that is what larger crowd 

et vanal viisil mo*tleb, ara moista ei voi 
that old -AD way-AD thinks understand not can 

'That is that, what a larger crowd that thinks in the 
old way cannot understand' 

Most examples involve a one-word constituent at the beginning of the clause, 
but examples like (6) above show that multiword constituents can occur before 
ep. as well. The location of ej> in second position is not strict, however, as 
demonstrated by (12). 

(12) Tana naeb ta kirikus Hildat ja temale ta ep HI led 
today sees he church-IN H. and her-AL he EMP flowers 

viibki 
brings-EMP 

'Today he sees Hilda in church and to her he brings 
flowers' 

Thus, e£ appears in the "third" slot in this example. Ariste further suggests 
that it can occur in sentence-initial position: see ep > ep see . 

Not only has ej> acquired phonological independence in its development 
from Proto-Baltic Finnic, but it seems to have acquired a certain amount of 
syntatic freedom as well. Numerous relic forms remain in the modern language, 
so that we know that ej> was a bound morpheme. These are discussed below in 
section 4.2. 

2.1. Negative ep 

Ariste points out that Wiedemann (1857) considered (e)p a clitic in his 
dictionary, but confused it with the negative verb ej>. The old third person 
singular of the negative verb was ep; it has been replaced in modern Estonian 
by ei, which has now spread to all persons and numbers. Nonetheless one still 
finds in the modern language ep ole (= ei ole ) 'is not' and ep olnud (- ei 
olnud) 'was not' alongside their proclitic forms pole and oolnud. 
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The continuation of Proto-Baltic Finnic #pa is affirmative emphatic ep, 
not negative verb eg. 

3. Estonian es 

Modern Estonian exihibits several es words, only one of which is a direct 
continuation of Proto-Baltic Finnic *s. The various homophonous morphemes are 
discussed below. 

3.1. Interrogative es 

Old Estonian interrogative es has been discussed by Ojansuu (1922) and 
Alvre ()976, 1981). It is a Second Position word, as seen in the 1686 New 
Testment examples that follow (from Ojansuu 1922: 93-94). 

(13) Niiiid es tee uSSute (Joh. 16: 31) 
now Q you believe 

'Now do you believe?' 

(14) Kelt es Se Proweet Sedda u'tlep (ApT. 8: 34) 
whom-ABI. 0 the prophet that says 

'About whom does the prophet say that?' 

(15) Kumb es 5e Suurem KaSk om SaduSSen (Matt. 22: 36) 
Which 0 the larger order is law-ILl 

'Which is the greater command under/according to the 
law?* 

(16) MiSt es roinna Sedda pea tundma (Luk. 1: 18) 
What-EL Q I that must know 

'How should I know thai?' 

(17) Mink ka es Sis Soolata (Matt. 5: 13) 
What also Q then salt 

'What kind then should you be?' 

(18) maratse es teije Sis peiite ollema (Joh. 6: 30) 
what-kind Q you then must be 

'What kind then should you be? 1 

(19) Marast Tahte es Sinna teet ...? (Joh. 6: 30) 
What-kind letter Q you make 

'What kind of letter/mark are you making ...? 

(20) -Sratsel Nimmel es teije Sedda ollete tennu? (ApT. 4: 7) 
what-kind name 0 you that are done 

'In whoso name have you done that?' 

(21) Kes om minno Emma, ninck kumma es omma minno Welle 
who is my mother and which Q is my brother 

'Who is my mother, and which is my brother? 1 (Matt. 12: 48) 

Examples (17 19) demonstrate that location in the sentence is determined by 
constituents, not words, since interrogative es appears not after the first 
word of the clause in these examples, but after the first constituent. One 
example shows that, like emphatic ej>, es can occur elsewhere in the sentence 
; n Second Position. 
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(22) Kellega teie es sin tahate konelda 
whom-COM you 0 here want speak 

'With whom here do you want to speak? 1 
(from Wiedemann's Grammar, cited by Alvre 197G: 340) 

Several of early examples still show es as a clitic. The word cannot maintain 
.wy phonological independence since there is no vowel present. All s examples 
are written togethe. with the preceding word (23-24), so that we have 
orthographic evidence that es wns once a Second Position Clitic. 

(23) — Kustas oeije Lanen ni pal jo Leiba Same (Matt. 15: 33) 

whence-Q our kind get so much bread get 
* — from where does our kind get so much bread?' 

(2-1) Sinnas utsinda woi.. oJlet... (I.uk. 24: 18) 
you-Q alone stranger are 
'Are you alone the/a stranger...? 

In both of these examples, the final s of the first word ( Kustas and Sinnas ) 
is the interrogative clitic. 

There is a semantic discrepancy in the correlation between Estonian 
interrogative es and BF informal ^s. I deal with this problem below in 
section 3. 4. 

3.2. Conjunction es 

Interrogative ejs is homophonous with* and according to some, 
etymologically identical to conjunction £s. The latter is found in a 
seventeenth century Southern Estonian verse, as well as in Old Finnish (that 
is, the southwest dialect used by Agricola, Finno, and Hemminki). In both 
Southern Estonian and Old Finnish, Ojansuu (1922) tells us, es had the meaning 
*if, although'. Ojansuu' s contention that 2P interrogative es originated in 
initially positioned conjunction es is a viable alternative account to 
Alvre's, so it must be examined in detail. 

In all of Oja*>suu's examples, conjunction es begins the clause, as 
conjunctions generally do in Finnish and Estonian. 

(25) South Estonian: 

Es mejie juhren olios Seddn i>altu ni palju 
if our place being that sin so much 

jummala juhren om wehl ennamh armu. 
god's place is still more favored 

'Tf in our place was so much of that sin, God's place is 
even more favored. ' 

(20) Agricola: 

Eipe heiden pirlhe woittamnn .... Es quinkn corkiaSti 

not -and they must win although how highly 

he lendeuel: - osquiga he ennen cucoiSt it 

they fly although-how they before flourished 
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'And they must not win . .., No matter how high they fly; 
■■- no matter how they flourished before.' 

(27) Finno: 

es cuSa Tnhimincn olis 
if where person would-be 
'if 11 person would be somewhere 1 

(28) Hemminki: 

Es cuca vihans phahan nacka 
if who anger head-ILL flings 
'If someone flings his anger into his head* 

Conjunction es is derived from a former demonstrative 
pronoun/relative/indefinite pronoun stem *e^. The e-stem also occurs jn the 
Finnish conjunctions etta 'that", ellei 'if not\ and dialectal ella 'if, and 
in Estonian emb-kumb 'either (one or the other)', cf. Hakulinen (1979: 74), 
Paasonen (1906) finds cognates of the Baltic Finnic e-system in Mordvin, 
Zyrian, Votyak, Ostyak, and Hungarian, and suggests that the e-stem is a 
variant of the Jo-stem (cf. Finnish jo-s 'if 1 just like conjunction e-s). 

The s in conjunction es is a lative ending, whit.:. * ■ :ilso found in the 
Finnish conjunctions jos 'if* and koska 'because, when* (the ^ka here is a 
former clitic see Nevis 1984), and in the adverbs alas 'down', ylos 'up' 
and £o ijs 1 away ' . 

Ojansuu proposes that conjunction es is the source for interrogative es. 
His proposal would require that there was a change in positioning in the 
sentence, a semantic change 'if, although' > 'whether', and a shift from 
dependent clause to main clause. It would leave open the question of why 
there is an absence of 2P clitics in Estonian and it would leave unexplained 
all the relic forms in Estonian (cf. section 4 below). Ojansuu 's account 
would have to posit not only the two syntatic changes and the semantic change, 
but also a phonological change (end is is— examples (23) and (24) above have 
clitic 2S)» 

Since some dialects have both interrogative and conjunction es words, 
Ojansuu would also have to assume a syntactic and semantic split. 

3.3. PastJTen se N egati ve es 

Interrogative es is also homophonous with another unrelated form in the 
language, namely negative past tense es. Some of the Estonian dialects have 
innovated a past tense for negative verb (stem in e^). In general in Daltic 
Finnic, the negative verb has a defective paradigm. It may be inflected 
for person and number, hut not for mood and tense. The Estonian dialects have 
allowed the negative paradigm to include tense, so that e-p is present tense 
and e^s is past tense, just like laula-b 'sings' and laula-s 'sang'. 

(29) es nae' tima miDaGi' (Savijarvi 1981: 111) 
not~past see s/he something 

'She/he did not see anything' 
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.1.4. Interrogative Negative es 

A Miird lmmoplionous es tomes from I lio negative verb <v plus clitic or 
from clitic combination ko s . As in Finnish, the negative verb ei combined 
with clilirs *ko rind *s to form a single lexical ixed unit: es ' * oi -ko~ s. 
(Finnish has eikos, oiks.) The difference between the affirmative and 
negative interrognti ves can bo neon in the morphology of the following verb 
and in the positioning of the particle §s. If the main verb is marked for 
person and number, then the cooccurnng es is affirmative: if the main verb is 
not marked for person and number, then the cooccurring es is negative. The 
reason for this is that the negative in Baltic Finnic is an auxiliary verb and 
takes the person and number marks while the main verb has a special complement 
form. Also, if affirmative es is used, then the particle is located in Second 
Position. But if negative es is used, the particle is always 
sentence-initial. This is because the negative verb counted ar. the 
sentence-initiai constituent to which 2P * -ko-s attached. Relevant examples 
are (30) and (31). Example (30) has finite main verb om Ms* (predecessor to 
modern Estonian on) rather than negative complement ole (as in (31)>. Tt 
further exhibits 2P es instead of initial es. 

(30) Old Estonian: (= (15)) 

Kumb es So Suurrcmb KaSk om Sadussen (Matt. 22: 36) 
Which 0 the larger order is law-Ill 

'Which is the greater command under/according to the. law? 1 

By contrast, the dialect example in (31) has the negative complement ole 'be' 
rather than main verb on 'is'. And in thir. example es is initial rather than 
second. 

(31) Poltsamaa dialect (SW Estonian) from Alvre (1976: 316): 
ed ta aTvem ole 

not-0 it chenper be 
'Isn't it cheaper' 

A, Re He. Forms 

Evidence for the declitic analysis of Estonian eg comes from the 
existence of relic forms in all the Baltic Finnic daughter languages. Shared 
relic forms indicate that the parent language had bound words rather than free 
words. Further evidence lies in the large number of relic forms in Estonian 
itself (not shnred with sister languages) showing that the sources for Old 
Estonian ej> and es were clitic before the Old Estonian period - pre-Estonian 
at the latest, most likely Proto -Baltic Finnic. 

4.1. Relic Forms in Baltic Finnic 

All of the BF daughter languages exhibit relit forms which indicate thnt 
Second Position enclisis is inherited from the parent language. These include 
emphasis word juba/jopa 'even', negative plus interrogative eks/eiks/eikos, 
and negative plus connective egas/e ikas . 

According to Kulima (1936), all the BF langudgos have words that 
correspond to Estonian Juba and Finnish jopa , both of which have an 
idiosyncratic, unpredictable meaning. The etymotuijif aJ sountj for iubii/jopa 
'even' is an old Germanic loanwoid ju, (Finnish jo) 'already' (Raun 1982: 21, 
i*f. Gothic Ju) plus *;mplmtiL toa. The result is not semantical ly 
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compositional 'even already' but a special emphasis word. Juba/.iopa has a 
typical syntax for a phonological word containing a Second Position Clitic; it 
can appear in other than initial position. 

(32) from Kal^ma (1S.6: 144): 
Finnish jo jopa 
Estonian juu juba 
Livonian ju juba, jub, j va 

All BF languages likewise show relics of combinations informal clitic 
*s. standard Finnish, for example, has en-ko-s 'don't I?', et-ko-s 'don't 
you?', ei-ko- s 'doesn't ?», etc. Colloquial Finnish has eiks 'doesn't ?'. 
Estonian has eks 'doesn't ?'. All of these come from the negative verb (stem 
in ez) and interrogative clitic *ko with optional informal clitic *s. The 
appearance of this form in Estonian is particularly surprising since it does 
not have the two clitics in question elsewhere. 

(33) a. Finnish 1SG en-k(b)-s -v 

2SG et-k(o)-s ) 

3SG ei-k(b')-s L NEG-Q-IF 

1PL emme'-k(b)"S C 

2PL ette-k(b)-s ) 

3PL eivat-k(o)-s J 

b. Estonian eks < ei/ep * ko + s 
NEG + Q IF 

Finally, the negative verb can combine with a former clitic conjunction 
*ka (see Nevis 1984) and informality marker *s. Finnish has en-ka-s 'and you 
don't', ei-ka-s 'and he/she/it doesn't*. Estonian has egas \md doesn't*. 
Again the presence of Estonian egas is unexpected here, since it displays (a 
relic of) clitic *s. 

(34) a. Finnish 1SG en-ka-s ~\ 

2SG et-ka-s / 
3SG ei-ka-s ( 

1PL emme-ka-s > NEG-CONJ-IF 
2PI* ette-ka-s \ 
3PL eivat-ka-s J 

b. Estonian egas < *ei/ep + ka + s 
NEG + CONJ + IF 

In summary, the complementarity of Estonian e^ and eg with the other BF 
languages* Wackernagel type clitics suggests a common origin. The relic forms 
juba/.iopa . eks , and egas indicate that the forms were once bound in BF. More 
evidence for the bound nature of these morphemes comes from strictly 
language-internal facts. 

4.2. Estonian relics 

Alvre (19H1) cites a number of Estonian relics forms in which bound s 
continues former clitic *s or clitic combination ♦ -ko-s . Bound s has an 
interrogative function hero, even if only redundantly in conjunction with some 
of the interrogative hosts — e.g. kuna 'when* — kuna-s 'when*. 
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(35) From Alvre (1981): 
kuna-s 'when' 
kuida-s 'how' 

palgu-St palju-ks 'how much' 
ammu-s, emrau-ks 'early?' 
ilma-ks 'free?' 
juba-ks, jooks 'already?' 
kaua-ks, kav*va-s 'far?' 
kaugele-ks 'far?' 

kuni-s 'up to what? as far as what?' 

mina-ks 'me?' 

sina-ks 'you?' 

mitu-ks 'how many?' 

muidu-ks 'otherwise?' 

nonda-ks 'like this?' thus?' 

on~ks, on-s 'is?' 

oli-ks 'was?' 

pea-ks 'has to?' 

veela-ks 'still?' 

vahe-ks, vaha-s 'few?' 

Dialectal forms include tuli-ks 'came?', rakkisi-ks-ma 'did I speak?', 
v gtsi-ks-roe 'did we take?'. anTsi-ks 'took?', miga-s 'what?', kurob-s 'which 
(of two)?', and ken-s 'who?' (Alvre 1981). 

There are also a number of -p - relics in Estonian. The list in (36) is 
taken from Ariste (1973: 36): 

(36) see'p see on 'that's that' (lit. that-EMP that is) 
kiillap 'certainly' 

siisap 'then' 
siiap 'hither' 
temap 'he/she' 
nondap 'thus' 
samap 'sane' 
sinap 'you (SG)' 
minap 'I' 
kustap 'whence' 
sealap 'there' 
annap 'give!' 
tulep 'come!' 

The ip-relics show greater variety in "host selection". j> attaches 
primarily to pronouns and adverbs, but can also be found connected to 
imperative verbs. 

Former clitics *o_a and *s cooccurred in some instunces, us revealed in 
relics £l>s 'doesn't?' e^ negative verb plus * -pa plus *^s, nondaps 'dann 
so' < nonda 'thus' plus » -pa~s . teps 'hinfort, von nun an' < *.js 'do!' plus 
» -pa-s . vastaps 'erst, soeben' < vasta 'just' plus ♦-pa-s (Alvre 1983). 

In some instances relics (k) s and _j> have acted prophylactically to 
retain an apocopated vowel. For example, interrogative veela-ks 'still?' 
retains older final a, but veel 'still' does not; it shows the effects of 
apocope. And kustap 'whence (EMP)' likewise retains older final a, while kust 
'whence' does not. In fact, a good number of Proto Baltic Finnic case 
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suffixes ended in a or a (according to vowel harmony). These final vowels 
were generally apocopated, unless a clitic such as _s or j> prevented apocope. 

(37) ELATIVE -st < -sta/-sta 

INSSIVE -s < -ssa/-ssa 

ABLATIVE -It \* -lta/-lta 

ABESSIVE -1 * 11n/»lla 



Non-initial a became e in Estonian. 



The result of apocope in Estonian was that the final vowel came to be 
reanalysed not as part of the stem, but as part of the ^ or ^ morpheme. 
Thus, when ♦kpj ta lost its fjnal vowel while ♦ kelta s retained the vowel, the 
result was a realignment of the vowel with respect to the morpheme boundary 
(see also Alvre 1981): 

(38) keltas > keltes > kelt 
kelta > kelt 



One would expect also as as a remnant of back vowel harmonic ^a (e.g. kusta-s 
> kust as ), but I assume that leveling was responsible for the spread of gs 
at the expense of as. A parallel reanalysis is necessary to explain the 
development of eg (section 5.1.). 

Tt is impossible to account for these relic forms in the cliticization 
approach — retention of a former morpheme- final vowel cannot be reconciled 
with language- speoi fic agglutination of an es or ep word. Instead, these 
relic forms demonstrate that independent es and eg used to be bound Morphemes, 
and thus decli t icization is required to account for their development. 



5. Clitic Development in Estonian and Finnish 



That eg and es arc independent words arising from phonological ly 
dependent words is clear from the preceding discussion. What remains to be 
accounted for is the semantic shift from informal ^§ to interrogative £S. 

On the basis of the Finnish and Estonian data, I propose a general 
account of possible and probable developments for Wackern age 1- type bound 
words. Old Estonian es and eg demonstrate that decliticization is one 
possible course of change. Old Estonian also shows loss of former 
interrogative clitic »-ko. Finnish -ko/-ko and -pa/ -pa reveal that clitics of 
this sort can be fairly stable as well. Affixation is also possible, though 
rare — Finnish informal is is one such example. 



5.1. Dec! it icization in Old Estonian 



Both es and eg were once phonological ly bound words. The two 
decli ticized at roughly the same time and in the same manner. When final 
vowels were apocopated circa 1250-1500 AD (Raun and Saareste 1966: 59, Kask 
1972: 155), clitics *^pa and ♦ (ko)-s acted prophylactically in preventing 
8pocope: 



(39) PROTO-BALTIC FINNIC OLD ESTONIAN after apocope 

a. *kelta v > kelt 
♦kelta-s v tkelta-s 

b. tpaalla N *paall 
#pna 1 1 a- pa > *paa 1 1 a *p 
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(Recall non-initial *a > e; note also diplithongization *aa > ea and 
certain degemi nations. ) 



Once the default instance is the apocopated stem and i.he less common 
instance is the vowel allomorph before a clitic, the situation is ripe for 
rei interpretation. The morpheme boundary is "moved", so to speak, such that 
the vowel is considered part of the clitic: 

(40) kelte--s > kelt-es 



And once vowel harmony is lost as a productive rule (circa 1650 AD — 
Raun and Saareste .1966: 65), there is no longer any evidence that -es and -ep 
are phonological ly dependent words rather than independent (though unstressed) 
words, i.e. quasi-enclitics. Since the rule that is responsible for 
phonological subordination consequently lacks motivation, it disappears from 
the language altogether, and the clitics are no longer "clitic" but 
independent. In other words, bound words are marked with respect to 
independent words, and I propose that decliticization here is an instance of 
the more general case in which marked become unmarked forms. 

Although one would expect both es, eg er.d as, ajg from the Proto-Baltic 
Finnic clitics, one finds only the e-vowel descendents. The two e/a 
alternates would be the continuations of former vowel harmonic alternates a/a 
as explained in section 4.2. T have been assuming that the e-stem allomorphs 
spread at the expense of the a-allomorphs (except in instances of 
lexical izat ion --- section 4.2). Leveling of this type is confirmed by cases 
where neither e^ nor a^ sources occurred. Examples (7-9, 11, 13, 17) above, 
have not had an intervene g vowel between the host and the clitic (or else had 
a different vowel). 

(7) Mis ep ... 
what EMP 

(8) Need ep . . . 
Ihese EMP 

(3) Siis ep 
then EMP 

Ml) .See ep ... 
that EMP 

(13) Niiu'd es ... 
now 0 

(17) Mink ka es . . . 
what also 0 

Thus, for example, Mi c , * ^7) does not reconstruct with :i vowel (i.e. not 
tmisa-pa"). These examples demonstrate the productivity and spread of gs and 
ej> at the evpense of the a-variants. 

When former *s ond *PJ failed to decliHuze in Estonian, they were 
1 OX i tali zed to specific lexeme and morpheme comb mat ions. Therefore the 



peale-p > peal-ep 
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productive forms ej> and es did not spread to every item. One finds a similar 
lexical! nation with Finnish informal ^s. 

5.2. Affixation in Finnish 

In Nevis (1985a) I argue that Finnish ^s is not a clitic but an affix. 
It is no longer productive insofar as it does not combine with just anything. 
It can be found in four . nations: it attaches to clitics -ko/-ko and . 
-pa/-pa (but not to clitics -han/-han 'you know, I wonder, by golly' or 
-kin/- kaan/kaan 'also, too, neither'), it attaches to interrogative pronouns 
kuka *who' and mika 'what' and relative pronoun joka 'who' (but not to other 
pronouns, e.g. se 'it, that' or roina 'T'), and it attaches to imperatives like 
otta-kaa-s 'please take' but not other verbal moods(e.g. ♦ otta-a-s 'takes'). 

This restricted distribution is uncliticlike. Affixes typically impose 
restrictions on host selection, but clitics do not — they are generally 
promiscuous in attachment (Zwicky 1977, 1984). All occurrences cf Finnish 
informal ^s can be accounted for by assuming that the s-forms are allomorphs 
of the non-s- forms. That is, tu le-pa-s (come-EMP-IF) is not trimorphemic, but 
bimorphemic tule-pas , where -pas is simply the "informal" allomorph of z£* 

Note that the two classes of phenomena in Finnish that allow informal 
are interrogative morphemes (interrogative pronouns plus their near cousins 
the relative pronouns, as well as the interrogative bound word -ko/-ko ) and 
euphatic morphemes (imparatives and emphatic clitic -pa/-pa ) . The connection 
to the former is revealing. It shows the crucial link between Finnish 
informal ^s and Old Estonian interrogative es. 

I propose that the primary source for Estonian es was precisely this 
lexicalized clitic » -ko-s > ^ks (> ^s > es). Interrogativity originates in 
the former Second Position clitic *ko, but through successive stages involving 
lexicalization of -ko-s and upgrading colloquial -k(o)s . the meaning is now 
continued in es. 

5.3. Clitic Loss 

Proto-Baltic Finnic interrogative clitic *kp. is now lost in Estonian. It 
played a role in the development of es, as described above, and it is found in 
relic forms eks .. ,vee1aks , and a few other relics (see (35) above). The loss 
of *ko probably came about through regular sound changes in the language. 
Apocope (circa 13th century — Raun and Saareste 1966: 63) would have dropped 
the final vowel, hence * ei-ko > * ei-k . and loss of final *n, k, h would have 
dropped the now- final consonant (Kask 1972: 155-156), hence ♦ ei-k "> el. The 
result is the awkward situation in which all interrogatives with former *kg 
become homophonous with declaratives. 

Interrogative * -ko and £§ (now archaic and dialectal) have been replaced 
by kas. Kas is positioned clause-initially and is of uncertain origin. Alvre 
(1983: 82) attributes to L. Kettunen the suggestion that kas came from the 
inperntive verb katso ! 'look!'. But bimorphemic ka-s with relic of former 
interrogative clitics * -ko-s is just as likely (see Alvre 1983). 

5.4. Wackernagel's Law 

Bound words that occur in Sentence Second Position are oftentimes stable. 
They rarely complete the agglutination cycle by becoming affixes. This is 
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because they are frequently incompatible semantically with the host. There is 
often no semantic or syntactic connection between the Wackernagel-type bound 
word and the initially positioned constituent. Affixation cannot be ruled out 
completely, as evidenced by Finnish informal ^s (section 5.3), but it seems to 
be one of the least likely developments of a Second Position clitic. 

If the Wackernagel-type clitic is not stable then it is either lost from 
the language (as with Proto-Baltic Finnic ♦ -ko in Estonian) or it becomes 
independent (as with Proto-Baltic Finnic and ♦-pa). Steele (1976) offers 
yet a different possible course of development — the Second Position clitic 
can turn into another kind of clitic. In several Uto-Aztecan languages, the 
Wackernagel-type clitic inherited from the parent attachment to the initially 
positioned constituent preceding it to the verb which followed it. Yaqui is 
an example: 

(41) ?inepo ne~?a-me?ak (Steele 1976: 554) 
I I -it-threw 

•I threw it' 

The former Second Position clitic pronoun ne no longer attaches to the first 
constituent in the sentence but to the following constituent, namely the 
verb. The former Second Position enclitic is now a verbal proclitic. 

Ard (1977, 1978) presents similar data from the developments in the 
Slavic languages, although clitichood of the morphemes in question is not 
established for sure (i.e. they are likely to be leaners, i.e. quasi-clitics, 
rather than bound words). Wackernagel-type words are found in Czech, Slovak, 
Slovenian, Serbo-Croatian, Transcarpathian Ukrainian, and in dialectally in 
Polish. Attraction to the verb has taken place in the other Slavic 
languages. In Russian, Belo-Russian, Ukrainian (except for transcarpathian), 
and dialectally in Polish the cognate morphemes have turned into suffixes on 
the verb; in Macedonian and Bulgarian they are located adjacent to the verb 
— after an imperative or participle, before n finite verb. Thus the 
alternative to Wackernagel's Law has been attraction to the head of the 
clause, namely the verb. 

6. Concluding Remarks 

Old Estonian es and eg evince the rare phenomenon of declit icization or 
loss, but also shows the possibility of a charge in the direction of 
attarhment. Tn an SVO langunge, the sandwiching of tli* " . ! P. ^1 lion 
clitics between the initial constituent and the verb permits the verb to exert 
a syntactic and semantic pull on the clitic group, so that they attach 
phonologically to the verb. Verbal clitics are more compatible with the host, 
and consequently are more likely to complete the agglutination cycle and less 
likely to decliticize. 



Notes 



♦This paper was composed at the Ohio Slate University, but completed at 
the University of Michigan. A general absence of available materials forces 
me to leave out. some potentially relevant Karelian, Vepsa, Votic and Livonian 
data. 
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Irrelevant grammatical information is left out from glosses. 
Abbreviations used in this article include: 

ABL ablative 

AD adessive 

AL allative 

CON.T conjunction 

COM comitative 

EL elative 

EMP emphatic 

HAN an episteraic clitic marking reintroduced 

information of current discourse relevance. 

IF informal 

ILL illative 

IN inessive 

PL plurai 

Q interrogative 

1. Jandn (personal communication) now informs me that he has given up 
one piece of his synchronic analysis, namely the claim that the Is morpheme is 
synchronically a determiner to the following NP. His diachronic analysis 
remains as before. 

2. Tauli cites Alfred Ludwig's (1873) article "Agglutination oder 
adaptation?", but I have not been able to locate that reference. 
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On Automatic and Simultaneous Syntactic Changes 3 
Brian D. Joseph 



0. Introduction 

Most linguists) in studying language change, have long assumed that 
there are changes which might well be described as being simultaneous! in 
that one change, D x , occurs at the same time as another change, Dy. In 
addition, it has also been assumed that there are changes which might be 
termed automatic, in that one change, D x , necessarily causes another 
change, Dy. In actuality, though, since the exact timing of changes is 
often hard to determine, it is generally the case that changes are 
counted as simultaneous if they at least appear to occur in close 
succession. 

It should be clear that not all simultaneous changes are linked in 
the causal relationship implied by the label "automatic". In particular, 
two changes — for example a change in the articulation of some sound and a 
reanal*'sis of a syntactic construction — may have nothing to do with one 
another yet may just happen to occur (virtually) at the same stage in a 
language's development. More often, probably, two changes that are 
simultaneous - ' or nearly so, to be more accurate — do stand in a 
cause-effect relationship, so that one change can be taken to be a 
consequence of the other change. Even in such cases, though, there need 
not be ar.y notion of necessity in the actuation of the second change, 

1. e. one does not have to be an automatic consequence of the other. Two 
examples from historical phonology demonstrate this difference well. 

Martinet, in several works (e.g. Martinet 1953), has argued for the 
existence of so-called "drag-chains" in sound change, in which one sound 
shift leaves a gap in a system but "drags" another sound along with it to 
fill that gap. For example, under one possible interpretation of the 
Grimm's Law consonant shift in pre-Germanic, the shift of the 
Proto-Indo-European voiceless unaspirated stops, e.g. *t, to voiceless 
fricatives, e.g. *&, left a gap in ths consonant system that was then 
filled by the Indo-European voiced unaspirated stops shifting to 
voiceless unaspirated stops, e.g. *d - — > *t. In such an account, the *t 

> *9 change dragged along the *d > *t change. While Martinet has 

in general viewed such a second shift as a .necessary consequence of the 
first, in actuality, sound systems tolerate many gaps happily, so the 
creation of such an imbalance in a system does not automatically occasion 
the filling of that gap through another sound shift. In such a case, 
then, two (virtually) simultaneous changes need not be causally linked. 

An example of an automatic change, though, is provided by the 
restructuring of underlying lexical representations brought on by 
unconditioned sound changes. For instance, when Indo-European *d became 
Germanic *t, lexical forms which had had *d were restructured so as to 
reflect the new pronunciation, as in the change of the word for 'ten': 
/♦dek'm/ — > /*texum/ (cf. Gothic talhun , English ten, etc., and note 
that there were other changes as well not relevant here). At the point 
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at which *d became tt, there was no longer any support for underlying /d/ 
either from morphophonemic alternations or even distributional evidence, 
ao the lexical fora— under any set of theoretical assumptions about how 
such forms are established by speakers learning their language— would 
have to change when *d changed. Thus the restructuring would have been 
(virtually) simultaneous with the sound change and an autoaatic 
consequence of it. 

This restructuring is an exaaple of an autoaatic change froa the 
reala of phonological change, but examples of autoaatic and siaultaneous 
changes have been proposed for syntactic change as well. However, the 
cases that have been proposed for autoaatic and siaultaneous syntactic 
changes are not without soae probleas. Accordingly, a brief review of 
seme of these attempts at uncovering this type of syntactic change is 
undertaken here, and then two case-studies are presented froa the history 
of Greek which provide stronger and aore convincing instances of 
autoaatic syntactic changes. 

It is important to point out, though, that it is very hard to prove 
conclusively that two changes are autoaatic or even that they are 
siaultaneous; this is a recurring problea in the evaluation of such 
exaaples. However, where one can find either no evidence to the contrary 
or else positive indications that the two changes were not separated by 
long periods of tiae, it can be assumed that two changes which appear to 
be (virtually) siaultaneous in fact are to be classified as such, for 
that assuaption allows for the possibility of interesting claias 
regarding the nature of syntactic change. 

Without the assuaption of soae kind of interaction between or among 
various changes, diachronic syntax becoaes little aore than a taxonomy of 
what changed between stage X and stage Y of a language; few, if any, 
interesting generalizations become possible about a theory of syntactic 
change, providing, for exaaple, a deliaitation of the range of possible 
changes in the syntax of a language. Therefore, wherever possible, the 
strongest position to take is that two apparently siaultaneous changes 
are in fact siaultaneous, for one can then work froa there to try to find 
an explanation for this siaultaneity. Accounting for one change in teras 
of another, by showing one to be an autoaatic consequence of the other, 
would be one way of providing such an explanation.! 

Furthermore, such explanations for syntactic changes, when 
available, can be used as a way of constructing arguments for or against 
particular theoretical stances, under the assumption that a synchronic 
theory provides the constraints on possible changes a language may 
undergo. 2 Such a position has been taken, for exaaple, by Lightfoot 
(1979a). He contends that Linguistic Theory should interact with a 
theory of change to pinpoint when grammars would undergo drastic 
reanalyses. One, can further claim, following the line of arguaentation 
being developed here, that a theory's ability to characterize one of two 
apparently siaultaneous changes as being in fact an autoaatic change, a 
necessary consequence of and thus explained in teras of the other, should 
likewise count as an important criterion upon which to judge competing 
theories of grammar. 
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In particular, in the examination of the putative automatic changes 
from the history of Greek, an argument is developed against a 
derivational framework for a theory of syntax and in favor of a 
nonderivational approach. Briefly, a derivational theory of syntax is 
one in which rules apply in a certain order lo produce a series of 
intermediate stages that convert a deep structure of a given sentence 
into a particular surface structure — the series of stages fomed by the 
output of rule applications is called a derivation. 3 In a nonderivatinal 
frajnework, by contrast, there is basically no difference between deep 
structures end surface structures and thus one is not converted into the 
other via a series of intermediate steps; instead, some notion such as 
the designation of levels at which syntactic generalizations can be 
stated (e.g. initial syntactic level, final syntactic level, some 
combination, etc.) — as in current versions of Relational Grammar and 
Arc-Pair Grammar 4 — or some division of labor into components— e.g. 
semantic as opposed to syntactic, as in Generalized Phrase Structure 
Grammar with its rule-to-rule semantics— is employed. The analogue to a 
derivational theory's step-by-step rules in a nonderivational theory is a 
set of well-formedness conditions holding on surface forms, where 
elements can appear, in what combinations, how they relate to other 
elements in the sentence, and so forth. In such a system, the notion of 
derivation has no place. 

Although comparing frameworks is a very tricky business, and 
perhaps not even possible, because the ground rules can be so very 
different in different frameworks, the two general approaches to syntax 
outlined here differ in one respect in the way they account for a 
particular change in Greek to be presented below. This distinction is 
discussed again in connection with that change after a look at some 
instances of automatic syntactic change that have been proposed in the 
literature. 

1. Some Previous Attempts at Finding Automatic. Syntactic Changes 

Among the instances cited as examples of automatic syntactic change 
(though not necessarily labelled as such) are the following two provided 
by Light foot in various studies. 

Lightfoot (1974, 1976, 1979a) has argued that a number of 
(nearly/virtually) simultaneous changes in the verbs which are now the 
Modern English modals ( can , could , may , etc.) were the automatic 
consequences of a single innovative restructuring of the base rules in 
16th century English. He claims that Old English and Middle English 
predecessors to the modals were real verbs, no different in any respect 
from other complement-taking verbs such as try or want , but that for a 
variety of reasons, they lost some verb-like features, e.g. no longer 
having full person and number paradigms, and were reanalyzed as forming a 
class distinct from that of try or want . 

Thus, from a set of phrase structure rules as (la) for Old md 
Middle English (which alternatively could have had the form in (lb)): 

1. a. OE/ME: S > NP AUX VP 

AUX > T(ense) 
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b. S — > NP VP 

VP > V + T (NP . . . ) 

Lightfoot claims that 16th century Bnglish innovated a new expansion for 
AUX, creating a new deep structure category of modals (= M), and giving 
the rules in (2): ' 86 

2. " S > NP AUX VP 

AUX — > T (M) 
M — > can , may 

Furthermore, he claims that this new phrase structure rule, this single 
innovation, triggered the four changes listed below in (3) as automatic 
consequences: 



3. a. no 'more infinitival forms of modals 

b. no more gerund (zing) forms of modals 

c. only one modal per (simplex) sentence (in standard language) 

d. no more have + MODAL + en combinations. 

The absence of infinitival forms, for example, follows necessarily 
because the new modal class only occurred as a tt sister M of T(ense) in the 
AUX node, a place where infinitives could not occur; similarly, the other 
changes in the modals are a necessary result of the nature of this new 
phrase structure rule. 



In another work (1979b), Lightfoot proposes yet another instance of 
an automatic syntactic change, this time in the English passive, and this 
time the result of the addition to the grammar of a single 
transformational rule of NP Preposing. This rule led to the existence of 
a transformational rule of Passive whereas prior to the 16th century 
Lightfoot claims, English had only a lexical passive rule. He is ' 
assuming a theory with a rigid distinction between lexical and 
transformational (or syntactic) rules, and argues that the properties of 
a syntactic rule as opposed to a lexical rule of Passive led to at least 
three automatic an* simultaneous changes in English passives; in 
particular, three new passive sentence-patterns, Hated below in (4). 
become possible: 

4. a. passives with underlying indirect objects promoted (e.g. 
John was given a book ) 

b. "prepositional" passives (e.g. The terms were agreed upon ) 

c. passives with NP-subjects that do not bear a semantic 

relation to the »ain verb (e.g. John was expected to win ). 

These changes would have occurred automatically, Lightfoot claims, 
because the new transformational (syntactic) rule of Passive could move 
any NP after the verb to subject position, whereas, according to the 
properties of lexical rules the theory specifies, the lexical rule could 
only relate an active direct object with a passive subject. The sentence 
patterns illustrated in (4), then, could only arise with the advent of a 
transformational rule of Passive, so that these patterns are an automatic 
consequence, in Lightfoot's account, of the addition of such a rule to 
the grammar of English. 
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Despite these neat-looking examples of (virtually) simultaneous and 
automatic syntactic changes, Lightfoot's analyses are not uncontroversial 
and do not provide unqualified examples of this type of syntactic 
change. In particular, in each instance, one has to wonder whether 
Lightfoot has correctly identified cause and effect, i.e. is it really 
the case for the modals that the disappearance of the infinitival and 
gerund forms was the result of a base restructuring, or is it not 
possible that these forms were lost and only then was the grammar 
restructured to have a modal category to accomodate these now anomalous 
verbs? Furthermore, since other verbs do not have a full set of 
nonfinite form* — for some speakers, it seems that the verb stride does 
not have a past participle, with neither has strode , has stroden , has 
stridden , has strided , nor has stridded sounding acceptable 0 — the modals 
may just be a special case of the loss of nonfinite forms being 
generalized throughout a semantic class such as that formed by the modals. 

Finally, exception can be taken to some aspects of Lightfoot's 
data. For example, regarding the modals, Lightfoot takes as significant 
the fact that the Oxford English Dictionary gives the last example of a 
modal infinitival or gerund form as occurring in the 16th century, even 
though the mere occurrence of a form in a text does not guarantee that it 
is still in current use (some texts are consciously archaizing, for 
example). Moreover, the process by which the verbs that ended up as the 
Modern English modals became specialized in their modal function and 
syntax was actually a very gradual change — for example, the gerund forms 
were rather rare at all prior stages of English 6 — and does not really 
display the suddenness that Lightfoot suggests. Thus, until the crucial 
examples that Lightfoot cites in support of his claims of simultaneity 
for these changes in question are subjected to careful philological 
scrutiny, his analysis has to remain tentative. 

Moreover, Lieber (1979) has suggested that the factual basis for 
Lightfoot's claims about changes in English passive sentence patterns is 
faulty, for she finds in Old English passive sentences of the type 
Lightfoot says first appeared only in the 16th century. She concludes 
that Old English had a transformational (syntactic) rule of passive, as 
well as a lexical rule, and that the changes that in Lightfoot's account 
were simultaneous and automatic consequences of the addition of a rule of 
NP Preposing were features of Passive that were already present in the 
language. Such a finding, of course, if valid, renders this example of 
automatic syntactic change nothing more than a mirage. 7 

Besides these putative automatic, simultaneous changes due to 
restructuring or rule addition, there are also examples in the literature 
which invoke language universals, and claim that a change D x 
automatically triggers another change Dy because brings on a situation 
in which some universal is "activated", so to speak, and satisfying that 
universal requires the further change Dy. In such an instance, D y is an 
automatic consequence of D x and by the definitions adopted earlier, is 
simultaneous (or virtually so) with D x . This type of explanation is 
evident in most of the work done recently on word-order change, in which 
putative universal correlations as in (5) have been called upon to 
explain, for example, a shift from postpositions to prepositions in Greek 
along with a shift in basic word order : B 
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5. OV with Postpositions / VO with Prepositions. 

/irv* Another ^^P 16 involving universals is one described in Joseph 
(1978, 1980) concerning what happened to Greek Object Raising and Object 
Deletion sentences, i.e. constructions analogous to the English sentences 
in (6): y 

6. a. Object Raising: John is easy to please. 

b. Object Deletion: Mary is pretly to look at. 

when affected by the Greek replacement of infinitives by finite verbs. 10 
In Greek, from Ancient Greek up to early Medieval Greek, there were 
Object Raising and Object Deletion sentences with a form entirely 
analogous to that found in Bnglish, with an infinitive in the complement 
clause and a zero-object (i.e. a missing object argument) with that 
infinitive on the surface. 11 Examples for pre-Modern Greek of Object 
Raising are given in (7) and of Object Deletion in (8), with a 0 
marking the Biasing object argument : 12 

7. a. he: ergasia mathein 0 ... hraiste: edokei einai 

the-work/FEM.NOM learn/INF easiest/FBM.NOM seemed be/IVF 
'The work seemed to be easiest to learn 1 (Xen. Oec. 6.9) 
b. tragoudousin to paranomon horo.'sai 0 muste:rion 
sing/3PL the-illegal/NTR see/INF rite/NTR 
•They sing of the rite (which is) illegal 

to see* (Spanos 26 (12th century)) 

8. a. kai ... horan 0 stugnos e:n (Xen. Anab . 2.6.9) 

and see/INF gloomy/NOM was/3SG 
'And he was gloomy to look at* 
b. tous ... khrusinous hetoimous ekhei tou dounai 0 
the-gold-pieces/ACC ready/ ACC. PL has/3SG PRT give/INF 
'He has the gold pieces ready to give 

over' (Doukas 1164A, 13-14 (15th century)). 

When the infinitive was replaced by a finite (i.e. perron-narked and 
tensed) verb in late Medieval Greek, sentences corresponding to (7) and 
(8) continue in the language, but in a slightly different form; in 
particular, the late Medieval and Modern Greek continuation of the 
earlier Greek constructions now have a pronominal object in the 
complement clause that corresponds to the matrix subject. This is shown 
by the Modern Greek sentences in (9), where ta in (9a) and tin in l9b) 
are obligatory object pronouns in the complement clause: 13 

9. a. ta anglika± ine diskola na 

the-English/NTR.PL are dif ficult/NTR.PL PRT 

taj katalavo 
them/NTR.pL understand/lSG 

'English is difficult for me to understand 1 (literally: 
"The English sre difficult that I understand them") 
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b. i maria^ ine omorfi na tin; kitazis 

Mary/NOM is pretty/NOM.FEM PRT her/ACC look-at/2SG 

'Mary is pretty to look at' (literally: "Mary is pretty 
that you look at her"). 



The change from a zero-object in the complement clause to an obligatory 
object pronoun can be taken as an automatic consequence of the change in 
the infinitive, i.e. in the nature of the complement clause verb itself, 
because of a putative universal constraint on Object Raising and Object 
Deletion constructions given in (10):^ 

10. Object Raising and Object Deletion cannot deprive a finite 

complement verb of its object. 

This constraint is observed in several languages, incloding French, 
Spanish, German, Albanian, Irish, Korean, Mongolian, ant) Arabic (see 
Joseph (1978, 1980) for some discussion) and can account for .the 
contrasts in (11) between unacceptable English Object Raisi^sf/Object 
Deletion sentences with finite complements versus acceptable oes with 
nonfinite complements: 

11. a. *John would be difficult for me to imagine (that) I might 



invite 0 to my party 

b. John would be difficult for me to imagine inviting 0 to 

my party 

c. tMelina is too ugly for us to be able to convince John that 

he should kiss 0 

d. Melina is too ugly for us to be able to convince John to 

kiss 0. 



If this universal is valid, the change in the possibility of a 
zero-object as opposed to a pronominal object occurring in the complement 
clause of Object Raising and Object Deletion sentences would have been a 
necessary change, given the change in the type of complementation from 
nonfinite to finite. 

A problem, though, with this account, and for that matter with iaty 
account making use of a universal, is that all universals are putative at 
best, subject to verification again and again as new data is brought to 
light, but therefore liable to be counter-exemplified by some piece of 
data not previously considered. For example, the potential Object 
Raising/Object Deletion universal given in (10) runs into some weak but 
nonetheless real counterexamples in English sentences such as (12): 

12. a. ?A book like that is tough to claim that you've read 0 
carefully. 

b. ?This rock is too heavy to claim that I can pick up 0. 

While not fully acceptable sentences, nonetheless, those in (12) are not 
as bad as (1) predicts. Thus this universal has some validity, for the 
sentences in (12) are not wholly well-formed, but it cannot be an 
absolute universal. As a result, in a Medieval Greek Object Raising or 
Object Deletion sentence with a finite complement and a missing object 
with that finite verb, there would not have been any necessity for an 
object pronoun to arise, even if this might have been a likely or even 
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preferred development. 

Si»ilr.rly, many of the word order universale are of a statistical 
nature only, and meet with counterexamples (e.g. Papago and Persian 
appear to he exceptions to the correlations noted in (5)).* 5 That being 
the case, one change would not necessarily force a second word-order 
change, at least as far as the correlation in (5) predicts. 16 

This cursory review of prior attempts to establish instances of 
automatic syntactic change shows that in order to get a good, i.e. 
relatively safe and unassailable, example of such a change, one needs 
cases in which either the data is clear or, if a universal is involved, 
it is one that is not controversial and can be supported by a wide range 
of relevant data. At this point, it is appropriate to exanine two 
changes from the history of Greek which meet these requirements in order 
to demonstrate hot only that automatic syntactic changes exist but also 
that they can be used in arguing for particular theoretical frameworks. 

2. Copy-Raising in Greek 

The two changes to be examined both involve and depend on a 
construction which can be referred to as "Copy-Raising" . In order, then, 
to understand these changes properly, some background on this 
construction is needed. The Copy- Railing construction is one in which a 
nominal originating in— i.e. semantical ly linked (in initial structure) 
to — a complement clause appears superficially in a higher clause but 
shows an overt marker — in the form of a "copy" pronoun — of its presence 
in its "point of origin" (i.e. the lower clause). English sentences with 
the matrix predicate look like , as in (13a), heve often been cited as 
examples of such a construction (the non-Raised version is given in 
(13b)): 

13. a. Bill} looks like he± is ready to leave, 
b. It looks like Bill is ready to leave. 

(13a), under such an analysis, would show a nominal ( Bill ) raised to 
subject status in the aatrix clause with a copy (he^) left in the 
complement clause. ' 

This Copy-Raising construction is found in Classical and 
Hellenistic Greek, and has been studied in this context by Marlett 1976. 
Some examples are given in (14): 

14. a. te:n ... huperbole:n to:n oreo:n ededoikesan 

the-pass/ACC the-mountains/GRN feared/3PL 

me: prokatale: phtheie (Xen. Anab. 3.5.18) 

lest be-occucpied/3SG 

'They were afraid that the mountain pass might be 
occupied* (literally: "They feared the mountain pass 
lest it be occupied") 
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b. epegino:skon de auton hoti houtos e:n 
knew/3PL and him/ACC COMP this/NOM.MAGC was/3SG 

ho kathe:menos (;icts 3:10) 

t he-si 1 1 ing/NOM. MASC . PPL 

'And they recognized that he was the one sitting* (lit.: 
"They recognized him that he was the sitting one") 

c. egno:n se hoti skle:ros anthro:pos ei 
knew/lSG you/ACC COMP hard/NOM.SG man/NOM.SG are/2SG 

'I knew that you were a hard man' (literally: "I knew 
you that you were a hard nan") (Matt. 25:24) 

Note that the copy is not always overtly present; because Greek has 
always been a language that suppresses unemphatic subject pronouns* a 
copy pronoun having subject status — as in (14a) and (14c) — does not have 
to appear on the surface. Marlett's analysis of these sentences as 
involving a Greek version of Copy-Raising, though admittedly not 
supported by hard evidence — such as demonstrably ungrammatical sentences 
and native speaker judgments — that is necessarily lacking for a "corpus 
language" such as Ancient or Hellenistic Greek, nonetheless can be 
adopted for two reasons. First, apparently synonymous non-Raising 
versions of such sentences can be shown to occur, and second, the logical 
structure of the predicates involved seems to point to a raising analysis 
(e.g. FEAR is a two-place predicate so that a sentence such as (14a) with 
a subject, a direct object, and a complement clause must not represent a 
basic structure with this verb). 

Modern Greek also has a Copy Raising construction. Although it is 
not all that common a construction and is restricted to just a handful of 
verbs, nonetheless it is a construction-type in the language and so must 
be part of any descriptively adequate grammar that might be constructed 
for the language. ^ An example of this construction is given in (15): 

15. Ceoro tin maria pos mono afti ine eksipni 
consider/lSG Mary/ACC COMP only she/NOM is/3SG smart /NOM. FEM 

'I consider only Mary to be smart* (literally: "I consider 
Mary that only she is smart") 

and it can be contrasted with a synonymous non-Raised version as in (16): 

16. Georo pos mono i maria ine eksipni 
consider/lSG COMP only Mary/NOM is/3SG smart/FEM 

'I consider only Mary to be smart* (literally: "I consider 
that only Mary is smart"). 

For Modern Greek Copy Raising sentences, the usual range of arguments 
based on selectional restrictions, idiom chunks, active/passive synonymy, 
Reflexivization, and Passivization are all available to show that the 
accusative noun phrase in such sentences corresponding to tin maria in 
(15) is in fact a nain clause direct object on the surface and moreover 
corresponds to an initial-structure complement clause subject. 18 Most 
common ly in Modern Greek, as in earlier stages of the language, these 
Copy Raising sentences have no overt copy pronoun in the complement 
clause, for unemphatic subject pronouns generally do not appear on the 
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surface in Modern Greek; the occurrence of mono 'only' in (15) provides 
the emphasis necessary for the occurrence of the pronominal copy afti. 

With these preliminaries concerning the nature of the Copy Raising 
construction out of the way, a ascription and account of the changes in 
this construction between (early) Post-Classical Greek and Modern Greek 
can be developed. The first change concerns a change in the form, 
specifically the person, of the complement clause copy pronoun in one 
subclass of Copy Raising sentences. 

3« The Chang e in Complement Clause Agreement 

In Copy Raising sentences in earlier stages of Greek, the raised 
nominal and the copy always were of the same grammatical person, as shown 
by the examples given earlier; for example, in (14b), both auton, the 
raised nominal, and hputps, the copy pronoun, are third person. 
Moreover, in sentences in which the raised nominal was coreferent with 
the matrix clause and therefore of the same person as the matrix subject, 
it appeared on the surface as the reflexive pronominal form; in such a 
sentence, then, the complement-clause copy likewise was of the same 
person as the matrix clause subject (and object). An example of such a 
Copy Raising cum Reflexivization sentence is given in (17), where the 
main clause subject—understood to be the second person plural form 
humeis (absent on the surface because it is unemphatic in this 
sentence)--the reflexive form heautous and the raising copy humin are all 
second person forms: 

17. ouk epiginotskete heautous hoti ie:sous 
not know/2PL yourselves/ACC COMP Jesus/NOM 

khristos en humin (2Cor.« 13*5) 

Christ/NOM in you/DAT. PL 

'Do you not understand that Jesus Christ is within you? 1 
(literally: "Do you not understand yourselves that 
Jesus Christ (is) within you?") 

In Modern Greek Copy Raising sentences with Reflexiviation in the 
matrix clause, though, a difference from the earlier pattern exemplified 
in (17) is evident. In particular, the raised nominal (the reflexive 
form) and the complement clause copy still agree in person (as in (14) 
and in (17)), but they are both different in peraon from the matrix 
subject. Such a sentence is given in (18), where the matrix clause 
subject is first person (suppressed as unemphatic, but understood to be 
ego I/NOM'), while the reflexive form ( ton eafton mu ) is third person 
and the raising copy ( aftos — -in parentheses since it is overt only if 
emphatic) is also third person; 

18. den 6a afiso ton eafto mu na petixi (aftos) 

not FUT let/lSG the-self/ACC my PRT succeed/3SG he/NOM 
»I won't let myself succeed' (literally: "I won't let the 
self of me that he succeed"). 

Bven though, as noted above, the raising copy can be ahsent on the 
surface if, h* is most usually the case, it is unemphatic, the third 



44 




- 38 - 



person verb agreement in the complement clause gives an indication of 
what person its subject is understood to be, and shows that it is 
different from that found in the matrix clause (as shown by the matrix 
clause first person agreement). There is thus a syntactic difference in 
Copy Raising sentences between Hellenistic Greek and Modern Greek in the 
person of the raising copy (and thus in the verb agreement in the 
complement clause when the copy is the subject) when the raised nominal 
is subject to Reflexivization in the main clause. It is this syntactic 
difference which demands an explanation. 

One fact which is relevant for an explanation of this change is 
that the reflexive form itself has a different character in Modern Greek 
from that found in earlier stages of the language. In particular, in 
Ancient Greek and Hellenistic Greek, the reflexive was a pronominal form, 
agreeing in person with its antecedent and showing different forms for 
the different persons; a portion (accusative case only) of the paradigm 
for the Ancient Greek reflexive pronoun is given in (19): 



19. 1SG emauton (MASC) 

2SG seauton 

3SG heauton 

1PL he: mas autous 

2PL humas autous 

3PL heautous 



emaute:n (FEM) 
seaute:n 
heaute:n 
he: mas autas 
humas autas 
heautas 



heauto (NTR) 



heauta (NTR) 



and the Hellenistic paradigm was similar. ^ In Modern Greek, though, the 
reflexive is itself a fixed form, consisting of ton eafto , literally "the 
self", with a possessive pronoun; it thus has the form of a possessed 
nominal, 20 being literally "the self" with a possessive pronoun and thus 
structurally parallel to ton adelfo 'brother' with a possessive 
pronoun. The reflexive form, therefore, is syntactically a third person 
nominal, just as is any nonpronominal noun phrase. The possessive 
pronoun is the only thing in the reflexive form that necessarily 21 shows 
agreement with the reflexive antecedent. A partial (accusative only) 
paradigm of the reflexive form is given in (20), along with the possessed 
form of 'brother' for comparison: 



20. 1SG ton eafto mu 1PL ton eafto mas 
2Sn ton eafto su 2 PL ton eafto sas 

3SG ton eafto tu (h/N) 3 PL ton eafto tus (M/N/F) 

ton eafto tis (F) 

cf. ton adelfo mu/su/tu/tis/mas/sas/tus 

'my/your/his, its/her/our/your/their brother'. 

This change in the form of the reflexive took place at least as early as 
the 12th century; an example from the poems of Glykas is given in (21): 

21. na pnikso: ton heauton mou (Glykas 288 (12th cent.)) 
FUT drown/lSG the-self/ACC my 

'I will drown myself. 



The. has thus actually been a multiple change in the form of 
Raising cum Reflexive sentences between earlier stages of Greek and 
Modern Greek. The reflexive form and the copy found in Raising cum 
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Reflexive sentences have both changed so as to differ from the Matrix 
subject (the reflexive antecedent) in person. It seems, therefore, that 
two (virtually) simultaneous changes have occurred, and Moreover, it can 
be shown that this set of changes provides an excellent candidate for 
consideration as an automatic syntactic change. In particular, it seems 
that the change in the form of the complement clause copy pronoun in such 
raising sentences is an automatic consequence of the change in the form 
of the reflexive. 

The motivation behind the necessary change of the copy to third 
person along with the change of the reflexive to third person status is 
the following universal: 

22. A copy must agree in all relevant features (i.e. relevant to 
the language in question, e.g. person and number) with the 
nominal of which it is a copy. 

It is safe to say that most linguists would agree that (22) is a fairly 
uncontroversial universal, one for which it would be very hard to find 
counterexamples. For instance, one class of apparent counterexamples, 
namely sentences such as (23): 

23. John, I can't stand the idiot 

is probably best analyzed as involving dangling topics rather than a 
nonagreeing copy. Furthermore, the need for such a universal independent 
of the Greek Raising cum Reflexive sentences under consideration is shown 
by copy /antecedent agreement in a variety of constructions in a variety 
of languages, including English look-copy sentences (cf. (13) above) and 
Left-Dislocation sentences, as in (24): 

24. a. The scissorsj look like theyi/#it need to be sharpened, 
b. John, I can't stand him/*her/#it. 

If this universal is valid, then a ready explanatin is provided for 
the apparent automatic nature of the syntactic change in question here. 
When the reflexive changed in person, the universal would guarantee that 
the copy pronoun in the complement clause — inasmuch as it is a copy of 
the nominal that surfaces as the reflexive—would also change. 

The only potential problem with this account—and thus with taking 
this change as a true instance of an automatic syntactic change triggered 
by a universal— is the fact that it cannot be conclusively demonstrated 
that the change in the complement clause copy was simultaneous with the 
change in the reflexive. Unfortunately, as is so often the case in such 
investigations, there is not very much historical data to back up the 
claim. However, in this instance, there is at least some. The reflexive 
change, as noted above (cf. (21)), took place no later than the 12th 
century. The first Raising cum Reflexive sentence, though, that occurs 
in Greek after the reflexive change dates from the late 16th century, 
from the Cretan comedy Katzourmbos : 
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25. thelo: aphe:sei ton emauto mou na ... 
want/lSG let/INF the-self/ACC my PRT 

ksale: smone: se: (ton nou mou) ( Katz . II. 173-4) 

forget/3SG the-mind/ACC my 

'I won't let nyself forget my mind'. 

In interpreting this sentence as a relevant example for the discussion 
here, it is necessary to take the third person singular verb agreement 
exhibited by the complement verb ksale:smone:se: , as indicating that the 
complement clause subject — absent on the surface because of unemphatic 
subject pronoun drop in Greek — was third person. What makes this 
aentence valuable, despite the four centuries between it and the 
reflexive change is the fact that no counterexamples, at least, with 
agreement patterns different from those in (25), are to be found. 
Sentences such as these, by their very nature, might well be expected to 
be quite rare, 22 So that despite the scarcity of conclusive historical 
data, there is really nothing standing in the way of taking (25) and its 
kind to represent a change that occurred concomitantly with the changes 
in the reflexive evidenced in (21). This consideration and the 
strong — and generally unassailable — universal in (22) that has been 
proposed to be at work in the change combine to make this a very good 
candidate for a real example of an automatic syntactic change. 

4. Sneaky Passives Diachronically 

Another aspect of the Copy Raising construction provides another 
instance of a syntactic change which, though lacking in some of the 
crucial historical data, nonetheless seems to be a real instance of an 
automatic syntactic change. In this case, however, there are some 
interesting theoretical dividends concerning differences between 
derivational versus nonderivational frameworks that can be reaped from 
the account of the change. 

This change concerns the status of Copy Raising sentences in which 
the complement clause is passive and the raised nominal corresponds to 
the agent ive noun phrase in the complement clause. An example of such a 
sentence from Ancient Greek is given in (26): 

26. dedoik' emauton ... me: poll' agan 
fear/lSG myself/ACC not much/NOM.PL.NTR too 

eirermena e: moi (Oed._ Tyr. 767) 

said/PASS. PPL.NOM.PL. NTR be/3SG me/DAT 

'I fear that too much has been said by me'. 

Such sentences can be referred to as "Sneaky Passives", following 
Perlmutter & Soaraes (1979: lb4ff. ). because in n derivationnl framework, 
these can be derived by applying Copy Raising in the matrix clause and 
then applying Passive "sneakily" into the complement clause; this latter 
step is possible because Copy Raising ) eaves a fully- intact complement 
clause, complete with subject (the copy pronoun) nnd object, and thus 
meeting the structural requirements for application of a passive rule. 
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This derivation is sketched in (27): 

27. UNDERLYING STRUCTURE: S [I f ea r S [I say too much] s ] s 



COPY RAISING : [I f ea r myself [I say too much]] 

(+ REFLEXIVE) JJ 

PASSIVE ("SNEAKILY"): [I fear myself [too much be said by me]] ' 
where many details of structure have deliberately been left out. 

/ocn 11 ^ Passives we™ full y grammatical in earlier stages of Greek, 
as (26) indicates. It is important to note that Copy Raising could in 
general operate on oblique nominals, as shown by examples (17) above and 
(28) below: 



28. a. phoboumai humas me: eike: kekopiaka eis humas 

fear/lSG you/ACC.PL not in-vain worke4/lSG among you/ACC.PL 
'I fear that in vain have I spent my labor among you* 
(literally: "I fear you lest I have spent my labor 
among you in vain") (Gal. 4 ; n) 

b. kai poiei pantasj ... hina do:sin autois* kharagma 
and cause/3SG all/ACC.PL COMP give/3PL them/DAT mark/ACC 
'And he caused everyone ... to receive a mark* 
(literally: "He caused everyone* that theyj give to 
thenii a mark") J ( Rev . 13; 16 ) 

The fact that oblique nominals were eligible for Copy Raising means that 
a Sneaky Passive" sentence such as (26) actually has two possible 
derivations. Besides the one outlined in (27), there is also a 
derivation in which passive applies in the complement clause followed by 
Copy Raising of the agent nominal created by passive into the higher 
clause. It is significant that (28b) shows Copy Raising of a dative noun 
phrase (autois), since it shows that even if the eligibility conditions 
for Copy Raising were stated in terms of case-marking, such a derivation 
for (26)— where the nominal corresponding to the raised noun phrase in 
the complement clause is a dative, moi— cannot be ruled out. It can be 
concluded, then, that a sentence such as (26) indeed has two possible 
derivations— the "Sneaky" Passive derivation and the one in which first 
Passive applies in the lower clause and then Copy Raising in the higher 
clause. 



Sometime between Ancient Greek and Modern Greek, and unfortunately 
there is absolutely no textual evidence to indicate when this may have 
occurred, a change took place in Copy Raising. Whereas in Ancient Greek, 
it seems that any nominal could be raised to object status in a higher 
clause (cf. the raising of obliques in (28)), in Modern Greek Copy 
Raising is restricted to operating only on subjects." Thus, sentences 
such as (29) are ungrammatical: 

29. *Georo ton yani pos ton vrika iliOio 

consider/lSG John/ACC COMP hia/ACC found/lSG stupid/ACC 
'I consider John that I found him (to be) stupid 1 . 
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Since there is no indication regarding the status of such sentences in 
Medieval Greek, the assumption can be nnde that the Modern Greek evidence 
shows that the change in which nominals can be raised via Copy Raising 
has taken place within the Modern Greek period, i.e. in what is roughly 
the contemporary language. 

What i? interesting here is that besides this change in the 
eligibility of nominals for Copy Raising, there is another change, namely 
a change in the status of Sneaky Passive sentences. In Modern Greek, 
Sneaky Passive sentences are ungrammatical, as shown by (30): 

30. *0eoro ton yani pos i maria vlaftike apo afton 

consider/lSG John/ACC COMP Mary/NOM hurt/3SG.PASS by him/ACC 
**I consider John that Mary was hurt by him'. 

Thus it appears that this is another instance of a simultaneous change — a 
change in what can raise occurs together with a change in the status of 
Sneaky Passives. Although the mere apparent (or stipulated) simultaneity 
of the two changes does not mean that one is an automatic consequence of 
the other, as noted above in section 1, it is desirable to treat them as 
having that relationship, for then one change can be explained in terms 
of the other. Thus, following that line of reasoning, an account is 
developed below in which the change in Sneaky Passives is an automatic 
consequence of the change in the Copy Raising construction; any such 
account necessarily is stronger — and thus more interesting and 
preferable — to one in which the changes are unrelated (again, as noted in 
section 1). 

In a nonderivational framework, as pointed out in section 1, there 
are no syntactic rules that work to convert a deep structure into a 
surface structure via a series of phrase-markers (or the equivalent). 
Instead, sentences are generated in their surface forms in accordance 
with the analogue of syntactic (transformational) rules, namely 
well-formedness conditions on these surface strings. The analogue of 
Subject-to-Object Raising, for instance, would be a well-formedness 
condition sanctioning the occurrence of a nominal that is semantical ly 
"relevant" (to use as theory neutral a term as possible) only in a lower 
clause (i.e. it bears a logical relation only in that clause) as an 
object in a higher clause. Put in terms of a nonderivational framework 
with a recognition of syntactic levels and grammatical relations, such as 
Arc Pair Grammar or Relational Grammar, such a condition would allow an 
initial level complement clause nominal, e.g. a subject, to occur as a 
final level matrix clause object. 

In such a framework, Ancient Greek Copy Raising would be a 
well-formedness condition such that a nominal bearing any final level 
grammatical relation in the complement clause may "legally" be the matrix 
clause object. This condition is stated in slightly more formal terms in 
(31): 

31. Condition on Ancient Greek Copy Raising: 

The final complement GR^ (= any grammatical relation) 
is the final matrix clause GR2 (= direct object). 
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In other words, (31) allows the occurrence of Copy Raising sentences in 
which the final matrix object bears any final level grammatical relation 
in the complement clause. 

A condition such as this sanctions. Sneaky Passives such as (26), 
repeated here for convenience: 



because the matrix object emautpjr-which does not bear a logical relation 
to the matrix verb—bears some final grammatical relation in the 
complement clause, namely the relation of the agent in a passive clause 
(the "Chtaeur" relation of Relational Grammar, the 8-relation of Postal's 
Arc Pair Grammar). The well-formedness condition states only that this 
nominal oust bear some relation in the lower clause; it does not restrict 
which relation this »ight be, so that passive agent meets the 
requirements of the condition. 

In Modem Greek, though, the well-formedness condition for Copy 
Raising sentences has changed so that instead of being able to be any 
complement clause grammatical relation, the matrix object can only be the 
final complement clause subject (as noted earlier— recall example (29)). 
The Modern Greek version of this condition is given in (32); 

32. Condition on Modern Greek Copy Raising: 



This differs from (31) just in the specification of GRj as opposed to 
OR*, that is, a relatively minor change from a formal standpoint. 
However, it is a change that has important consequences. In particular, 
(32) automatically rules out Sneaky Passives because in Sneaky Passives, 
the matrix object in a Copy Raising is not the final complement clause 
subject, but rather is the final passive agent (i.e. chomeur or 
8-relation) . 

There are admittedly a few potential problems with this account. 
In particular, since in Greek Raising there is a copy of the raised 
nominal in the lower clause, is it the copy or the matrix object that is 
considered to bear the relevant grammatical relation in the lower 
clause? It* may be necessary to stipulate something to the effect that a 
copy counts the same as the form of which it is a copy in terms of 
satisfying the well-formedness conditions and the "is" relationship 
utilized in the Raising well-formedness conditions (i.e. in (31) and 
(32)). Alternatively, the existence of a chain of "control" linking the 
matrix object with its copy in the lower clause may be sufficient. This 
particular problem, however, is not a problem just for Copy Raising but 



26. dedoik' emauton ... me: poll* agan 
fear/lSG myself/ACC not much/NOM. PL . NTR too 




( Oed. Tyr. 767) 



The final complement GR\ (= subject) is the final matrix 
clause GHfc (= direct object). 
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rather is a general problem of determining how the overall syntactic 
framework should treat copies of nominal s that bear particular 
grammatical relations. 

Still, in this framework, the problem posed by the apparent 
simultaneity of these two changes — the change in what can raise and the 
change in the status of Sneaky Passives — finds a straightforward 
solution, for the change from (31) to (32) automatically triggers the 
change in the acceptability of Sneaky Passives. The ease with which the 
nonderivational framework can account for these two changes is striking, 
for it contrasts with the extra machinery and extra assumptions needed in 
a derivational account of the changes. 

For convenience in exposition, a derivational framework is assumed 
here in which rules are stated in terms of grammatical relations; this 
decision allows for easier comparison with the nondeirvational account 
outlined above, for rules in that account are so stated. The main 
assumption behind a derivational approach is that syntactic rules convert 
an initial structure into a surface structure through a series of 
sequentially-ordered steps (phrase markers, in the terminology of 
transformational grammroar). Raising, therefore, in such a framework is a 
"process" by which a nominal in a lower clause becomes the object in a 
higher clause. For the sentence-type under consideration, i.e. sentences 
parallel in structure to (26), it has been established (see above, 
earlier in this section) that two derivations — two sets of sequentially 
applied syntactic rules — are possible: first Passive in the lower clause 
followed by Raising in the higher clause but also the "Sneaky" Passive 
derivation with first Raising in the higher clause and then Passive 
applying "sneakily" in the lower clause left intact by Copy Raising. 

It is well-known that the assumption of sequentially-ordered rules 
in syntax has led to the recognition of the need for the cyclic 
application of syntactic rules. That is to say, natural languages 
exhibit syntactic phenomena, well-discussed in the literature,^* which 
require recourse to a device such as cyclic rule application in order to 
be accounted for in a derivational framework. In cyclic rule 
application, syntactic rules apply as a block sentence by sentence from 
the most deeply embedded clause in the phrase marker to the topmost 
(matrix) clause. In a derivational framework, therefore, the cycle has 
been posited as a linguistic universal. 

Similarly, along with the cycle, it turns out that there arc 
phenomena in natural languages which require the imposition of a 
constraint— the Strict Cyclicity constraint — which prevents a rule from 
applying (or reapplying, as the case may be) into an already cycled-on 
domain. With such a constraint, once a higher clause has been reached in 
the cyclic application of rules, a lower clause— an already cycled-on 
domain — would not be a possible domain for a rule. With Strict 
Cyclicity, rules cannot "reach down", so to speak, so as to apply 
entirely within a cyclic domain that has already been passed. As with 
the cycle itself, the Strict Cyclicity condition has been proposed as a 
linguistic universal. 

However, in at least some versions of derivational frameworks, 
there are rules which can be called noncyclic or postcyclic, i.e. they 
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are not "in" the cycle. These rules, Moreover, can apply freely into 
already cycled-on embedded clauses. An example of such a rule would be 
Relativization or Question Movement;* 5 thus a question word which 
originates in an embedded clause nonetheless can be fronted when the 
matrix clause is reached after the cycle, as in (23), where the 0 
indicates the deep structure point of origination of the question word: 

33. Who did Sally think John felt Bill was ready to hit 0? 

One interpretation of this observation is that the principle of Strict 
Cyclicity is valid only for cyclic rules, and does not hold, universally 
it would be posited, for rules not in the cycle (whether demonstrably 
post-cyclic or'si»ply not demonstrably cyclic) rules. .This is an 
interpretation which becomes crucial later on in the discussion. 

Thus, a derivational framework has derivations, it has the cycle, 
and it has a principle of Strict Cyclicity that is restricted to cyclic 
rules. Given these elements of the framework, the change in Greek Copy 
Raising would automatically trigger a change in the status of Sneaky 
Passives. For Ancient Greek, the framework just outlined would allow 
only the derivation of Raising cub Passive sentences such as (26) only 
via the derivation in which Passive applied in the lower clause and then 
Raising applied in the higher clause to raise the now passive agent to 
object status (recall that in Ancient Greek, Raising could operate on 
nonsubject nominals). The other possible derivation— the H Sneaky H 
Passive derivation by which first Raising applied in the higher clause 
and then Passive "snuck" down into the lower clause to apply and put the 
original subject into an agent ive phrase— would be ruled out becuse it 
would violate the principle of Strict Cyclicity (by reaching down into an 
already cycled-on domain). 

Therefore, the change in which nominals were eligible for Raising 
(see (31) and (32) above) would be reflected also in a change in the 
status of Sneaky Passive sentences such as (30), because the only way 
(30) could be derived in this framework is by Raising an oblique 
(agentive) nominal (the other derivation being ruled out by Strict 
Cyclicity). A restriction on what can raise— fro» any nominal to only 
subjects— therefore automatically leads to a situation in which the only 
possible derivation for a sentence is systematically ruled out. Such a 
sentence is thus ungrammatical, for it cannot be derived. 

At this point, from a comparison of the derivational account just 
presented and the nonderivational account preceding it, it would appear 
that the two accounta are equivalent. In both accounts, the change in 
Sneaky Passives falls out as an automatic consequence of the change in 
Copy Raising acting in concert with certain aspects of each framework 
that are either built-in or are universal parts of the theory in question 
(as the cycle is in a derivational framework). 

However, on closer inspection, it turns out that there is a crucial 
difference between the two accounta. In particular, the derivational 
account must make one further, unwarranted and thus unmotivated 
assumption. 
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It cane out earlier in the discussion that under certain 
interpretations of the way in which rules such as Question Movement 
operate, the principle of Strict Cyclic ity would have to be valid only 
for cyclic rules. The consequence of such a restriction of this 
principle, however, is that only as long as Passive is a cyclic rule will 
it be constrained by Strict Cyclicity so as not to apply "sneakily" into 
an already cycled-on domain. That means that the derivational account 
must .take the additional assumption that the rule of Passive stayed as a 
cyclic rule between Ancient Greek and Modern Greek, i.e. that it 
maintained its "cycle-type" and did not become noncyclic. This is a 
result which could be guaranteed by an appropriate typology of cyclic 
rules, such that a rule with properties such as the Passive rule has 
would necessarily be a cyclic rule, but in the absence of such a 
typology, 26 it would constitute an extra assumption necessary in a 
derivational framework. This result would mean further that the 
ungrammatically of Sneaky Passive sentences in Modern Greek really is 
not an automatic consequence of the change in Copy Raising, for Sneaky 
Passives could have remained grammatical if Passive had changed its cycle 
type and become a noncyclic rule (and thus not subject to Strict 
Cyclicity). 

Furthermore, there is some evidence to suggest that rules can 
change their cycle type diachronically and move from being a cyclic rule 
to being noncyclic. In particular, Ref lexivization (at least the 
so-called "Direct Ref lexivization") in Ancient Greek, as best as can be 
determined, was probably a cyclic rule, inasmuch as it interacts with 
apparent EQUI and Raising constructions much as Reflexivization in 
English interacts with the English analogues of those constructions. 
Since that interaction in English has generally been taken as evidence 
for the cyclicity of Reflexivization in English, a similar conclusion can 
be drawn for Ancient Greek Reflexivization. 27 However, in Modern Greek, 
due in part to the change in the morphological make-up of the reflexive 
expression (see the discussion in section 3 regarding (19) and (20)), 
Reflexivization must be taken as a noncyclic rule. The crucial sentences 
that lead to this conclusion are the following: 

34. a. den da afiso ton eafto mu na me katastrepsi 

NEG FUT let/lSG the-self/ACC my PRT me/ACC destroy/3SG 
'I won't let myself destroy myself (literally: "I won't 
let myself that he destroy me") 

b. #den 6a afiso ton eafto mu na katastrepsi ton eafto tu 

his 

c. *den 6a afiso ton eafto mu na katastrepsi ton eafto mu 

my. 

If Reflexivization were cyclic, then it would be expected that (34c), 
with multiple occurrences of ton eafto mu , in both the lower clause and 
the higher clause, would be grammatical. However, (34c) is 
ungrammatical, as is (34b), where there is a reflexive form in the lower 
clause but the possessive that occurs with it is third person (agreeing 
with the person of the reflexive in the higher clause). The only 
acceptable version of such a sentence with Raising and Reflexivization 2 ** 
is that given in (34a), in which there is Raising and Reflexivization in 
the higher clause but no evidence of Reflexivization in the lower 
clause. Such facts run counter to the predictions made by an assumption 
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of cyclicity for Reflexivization in Modern Greek, so it can be concluded 
that the rule is noncyclic in this stage of the language and that 
therefore Reflexivization has changed its cycle-type between Ancient 
Greek and Modern Greek. * 9 

An additional example of a change in cycle- type of a rule has been 
proposed by Haiman 1974. He argues that various phenomena connected with 
the Verb-Second constraint in Germanic languages first entered particular 
grammars as postcyclic processes, subsequently became cyclic, and in some 
cases forced a deep structure reanalysis. For instance, he points to the 
fact that many dummy pronoun insertions (e.g. the it of English 
it-Extraposition, there of there -insertion, etc.) are cyclic in modern 
Germanic languages but appear to be postcyclic in earlier stages of the 
langauges in question. A similar analysis is offered for the Verb-Second 
Constraint itself. 

These two examples, Greek Reflexivization and the Germanic 
Verb-Second constraint, suggest that a change in cycle-type is a possible 
type of change that a rule (or grammatical constraint) can undergo in a 
derivational theory. That being the case, the change in Sneaky Passives 
can only be an automatic consequence of the change in what can undergo 
Raising if it is assumed that Passive stayed a cyclic rule in Greek (or 
if an adequate typology of cyclic rules is developed— see footnote 26); 
since there is no reason, in the absence of a suitable typology, why 
Passive should remain cyclic, it must be concluded that the derivational 
account cannot adequately characterize a relationship between the change 
in Sneaky Passives and the change in Copy Raising. 

Thus the nonderivational account actually provides a better 
explanation of these changes in Greek than the derivational account does, 
for it does not require the additional ad hoc assumption regarding the 
maintenance of cycle-type for Passive. Consequently, the nonderivational 
account is to be preferred. These differences are summarized in (35): 

35. In order for the change in Sneaky Passive to be automatically 



accounted for: 

a. a derivational account needs: 

i. the cycle and a principle of Strict Cyclicity valid 
for cyclic rules (this is given by the theory) 
ii. the change in the Raising rule (see (31) and (32)) 
iii. the assumption that Passive maintains its Cycle-type 
and is therefore subject to Strict Cyclicity 

b. a nonderivational account needs: 

i. a notion of levels to which syntactic rules can make 
reference (this is given by the theory— in the 
version of nonderivational grammar assumed here) 
ii. the change in the Raising rule (see (31) and (32)). 



The nonderivational account has no derivations and therefore no 
cycle; the problem of change in cycle-type is, for such a framework, 
really only a pseudo-problem, for it is one that is forced only by the 
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ground rules of the derivational framework. Thus this syntactic change 
in Greek makes it clear how a derivational framework is burdened by all 
sorts of extra devices and machinery, such as the cycle, as well as 
problems, such as a change in cycle- type, that are, in a real sense, 
nothing more than artifacts of these extra devices. 



5. C onclus ion 

By way of conclusion, a summary of the results developed here can 
be given, as well as a recognition of some remaining problems. 

First, it should be clear that automatic syntactic changes do 
exist, even though some of the previous attempts at uncovering such 
changes are prohably not as conclusive as they might appear at first to 
be. The examples involving changes in the Greek Raising construction 
seems to be fairly good cases of automatic syntactic change. 

Second, it :s also clear that different theoretical frameworks 
handle the same syntactic change in different ways, demonstrating that 
the view that emerges of what changes there are in a language is to a 
large extent colored by one's view of how synchronic grammars are to be 
characterized. It is possible, moreover, to compare the accounts that 
are provided in different frameworks. Light foot (1979a) has claimed that 
diachronic syntax provides M a new style of argumentation for choosing 
between competing theories and synchronic descriptions, hy requiring that 
the theory interact with a theory of change to account for the point at 
which grammars undergo reanalyses or 'catastrophic' changes". The 
suggestion here is that the ability to relate simultaneous changes as 
being automatic changes is another criterion for deciding between 
competing theories to which diachronic syntax can contribute. 

Third, given such a criterion for deciding between competing 
theories, the evidence from the change in the status of Sneaky Passive 
sentences in Greek shows that a nonderivational approach to syntax is to 
be preferred over a derivational approach, for the latter provides a 
simpler and less ad hoc account of the Sneaky Passive change and of the 
means for connecting the Sneaky Passive change with the Copy Raising 
change in a manner that is both natural and automatic. 

Finally, it is only fair to mention what seems to be the only real 
problem with the discussion of these changes in Greek syntax, namely the 
lack of conclusive historical evidence. There are clear indications 
about the status of S: relevant constructions in the first stage, 
Ancient Greek (i.e., in this context, Classical and New Testament Greek), 
and about the lft?t stage, Modern Greek, but only meager indications about 
the intervening stages, especially in the case of the Sneaky Passive 
construction. Unfortunately, there is nothing that can be done about the 
lack of data; 30 it is simply a fact of life in historical studies that 
crucial data can oe missing. One might simply dismiss these otherwise 
very interesting exer-"* * p or that reason or else take them at face value 
and try to grapple with — i. The latter approach has been adopted here, 
for the former seems counter- productive in that it limits the extent to 
which an understanding of syntactic change can be increased. 
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Notes 



*This paper was read as an invited lecture to the Indiana 
University Linguistics Club in March of 1980. What is published here is 
essentially that version, with a few relatively minor changes in 
organization and diction, as well as updating of references, providing of 
footnotes, etc. 

1. There are of course other nodes of explanation that one night 
explore, such as a functional or social notivation for the occurrence of 
one change or another. It is ny belief that no one type of explanatioi , 
whether fornal (such as is pursued here), functional, social, or 
whatever, precludes the investigation of causation fron one of the other 
perspectives, i.e. there is not only a fornal side to language (how it is 
represented in a speaker's individual conpetence), but also a functional 
side (the uses to which a speaker puts this competence), a social side, 
etc. 

2. This assumption is justified by the view in which diachrony is 
seen as the transition of a language through a succession of synchronic 
stages. Linguistic theory provides the constraints on possible 
synchronic systems, so that the movement through time is always the 
movement of one possible system into another possible system. Therefore, 
a theory of synchrony must be able to account for (at least certain 
aspects of) diachrony as well. While it may be true that not all 
linguists would adhere to such a view, it is not an idiosyncratic one; 
Culicover (1984: 118), for example, labels such an assumption 
"reasonable". 

3. Although tbe issue of whether grammars are derivational or not 
is less an issue now than when this paper was first written (1980), it is 
important to note that some still-current theories are derivational in 
principle, including Government and Binding Theory (despite the 
considerable limitation on the number of rules that apply to convert deep 
structures into surface structures), while others, including Generalized 
Phrase Structure Grammar, are distinctly nonderivational. 

4. See, for example, Postal 1986, and the papers in Perlmutter 
1983 and in Perlmutter and Rosen 1384. 

5. To my ear, h as stridden does not sound all that bad. As 
Jeffrey Huntsman of Indiana University has kindly pointed out to me, the 
form stridden does occur in a variety of texts and there is also the form 
bistridan in Cld and Middle English; moreover, the oddity of a perfect 
tense formation with stride may be more a function of the relative 
infrequency of the base verb itself, and not a peculiarity associated 
with the past p u , '.iciple. 

6. I am grateful once more to Jeffrey Huntsman of Indiana 
University for reminding me of this important fact. 

7. The debate perhaps should not be closed on this point, however, 
for Russom (1982) has shown that Lienor's examples of syntactic passives 
in Old English have been misanalyzed. I leave this matter to future 
research, inasmuch as my intent here was only to provide an example that 
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has been given in the extant literature of a change that would be 
simultaneous and automatic in the sense developed hero. 

8. The literature on word-order change and the relevance of 
universale is enormous, and no attempt can be made hero to even briefly 
survey the field; the reader is referred to Hawkins (1979, 1980, 1982a, 
1982b, 1983), Smith 1981, Coopmans 1984, and Payne 1985 for discussion. 

9. The exact analysis of these sentences is irrelevant to the 
point being developed here. In particular, in some theoretical 
frameworks, e.g. Government & Binding Theory, there is no "raising" at 
all in (6a) but rather WH-Movement, and in nonderivational frameworks, 
there are no raising or deletion processes at all. Moreover, in some 
accounts, both sentence-patterns in (6)— despite some differences they 
exhibit, for example, in the possibUity of extraposition— are treated as 
involving the same type c * syntactic structure. Such considerations are 
irrelevant to the matte *t hand because in any theoretical framework, 
sentences such as thos. (S) and the construction-type(s) they 
represent must be accoui * for in some way, and the point regarding 
automatic changes in the . ek equivalents of these patterns can thus be 
translated into the appropriate theoretical analogue. 

10. For details on the loss of the infinitive in Greek, see now 
Joseph 1983 and references therein. 

11. There is of course e.lso a missing subject argument as well 
with the infinitive, a fact which is less significant for the changes to 
be described here than tri missing object. 

12. For further examples and discussion, see Joseph (1978, 1980, 

1983). 

13. It is important to point out that the complement object was 
obligatorily missing in these constructions from Ancient Greek up through 
early Medieval Greek. Examples of these constructions from Medieval 
Greek are to be found in Joseph (1978, 1980, 1983). 

14. I purposely state the constraint in this form, mentioning 
specific constructions, and not in some more general manner (e.g. 
utilising the various versions of the Tensed-S Condition of Chomsky 1973) 
in part because it is irrelevant just how generally this constraint holds 
in the grammar, as long as it is valid for the constructions under 
consideration, and in part because I feel that there is sufficient reason 
not to believe that the Tensed-S condition is a valid condition (cf., for 
example, the possibility of WH-Movement out of tensed clauses — though 
such constructions admittedly have a different analysis in Chomsky's 
framework) . 

15. Hawkins (op. cit.), though, does attempt to address this 
problem through the use of implications involving more than two elements, 

which he claims leads to better statistical validity (e.g. SOV > (ADJ 

f N — > GEN f N)). 

16. This point has been made inost recently by Payne 1985, though 
see also Coopmans 1984. 
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17. The exact analysis of this construction has been the subject 
of considerable debate among generative grammarians of Modern Greek; see, 

mLS* 8 ?^!:; J08eph (1976, 1978: cha P ter 1980), Philippaki-Warburton 
(1979, 1986), Kakouriotis 1979, Ingria 1981, inter alia. The facts under 
consideration remain the sane, and the debate has been spurred in part by 
changes that have been made in the assumptions that underlie the 
theoretical frameworks adopted by different linguists. 

18. See the references in footnote 17, especially Joseph (1976), 
for details. Note that even in those frameworks (e.g. that of Ingria 
1981 or Philippaki-Warburton 1986) in which no raising is assumed for 
this construction, there is obligatorily a "linkage" between the matrix 
accusative nominal and the complement clause subject position. Thus, the 
account to be given here of changes in this construction, an account 
which, given the framework adopted here, depends on the assumption of 
Copy-Raising, could be translated into a framework with no raising 
through the use of this obligatory linkage. 

19. Admittedly, the second person plural form in (17), heautous, 
differs from the Ancient Greek form, and shows generalization of the 
older third person plural form; however, heautous in (17) can be assumed 
to be person marked since person was still paradigmatic in the 
Hellenistic reflexive, with overt person distinctions found in the 
singular, for example. 

20. For example, the reflexive can be modified by an adjective in 
the manner that nouna are modified, with the adjective between the 
definite article and the nominal, but not in the manner that pronouns are 
modified, with the definite article and adjective after the nominal; for 
complete details and furth' relevant arguments, see Joseph & 
Philippaki-Warburton (19^ sections 1.6 and 2.1.2.2). 

21. Optionally, the reflexive form itself can occur as a plural if 
the antecedent is a plural noun phrase, as in : 



In such a case, the reflexive form shows agreement in number with the 
antecedent, but this agreement does not encode person on the reflexive 
form nor is it obligatory. 

22. For example, in Modern Greek written documents (books, 
magazines, newspapers, etc.)— a much larger "corpus" than is available 
for Medieval Greek, especially Medieival Greek of the 10th to 14th 
centuries, i.e. near when the Reflexive change occurred— I have found 
only two examples of Raising cum Reflexive sentences. Native speaker 
judgments of the form that such sentences must take, however, are 
remarkably uniform, all agreeing that if raising occurs and if the 
conditions for the appearance of the reflexive form are met, the 
complement clause must have third person agreement on the verb. These 
facts have been discussed briefly in Joseph (1978: Chapter 10; 1979), 
and in some detail in Joseph & Perlmutter (1979). 



i. 



emis kitazume tus eaftus mas 
we/NOM watch/IPL the-selves/ACC.PL our 
*We are looking at ourselves'. 
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23. There is some controversy on this point. Ingria 1981 has 
drawn attention to some sentences which appear to involve raising of 
nonsubjects. My informants in general were most unhappy with such 
sentences, hence ray conclusion that Raising is restricted to subjects. 
Moreover, the only sentences with apparent nonsubject raising that my 
informants accepted were those involving perception verbs, a class which 
has proven notoriously resistant to analysis in many languages. Ingria 
himself, furthermore, argues that the structures in question are not 
Raising structures, a conclusion I support for the nonsubject instances 
but not for those involving suhjects. See footnote 17 for references on 
some of the controversy surrounding this construction. 

24. See, for example, the discussion in Perlmutter & Soames 1979. 

25. I am adopting here an analysis of these rules in which they do 
not apply in "successive cyclic'* fashion. 

26. One possibility is that rules that change grammatical 
relations are necessarily cycle (a principle to this effect was propsoed 
by Perlmutter and Postal in 1974 * ^tures). However, there are rules in 
English that seem to have the effec of changing grammatical relations 
that do not however "feed" clearly cyclic rules such as (English) Passive 
and thus seem not to be in the cycle. An example is the Benefactivc — > 
Direct Object rule which produces sentences such as I baked Mary a cake ; 
for many speakers, Mary, cannot be passivized (i.e. *Mary was bak ed a ca ke_ 
(by me) ), a fact which would be accounted for if Benefactive — > Direct 
Ohject were a noncyclic rule. Such an analysis would mean giving up a 
typology of rule cycle-type based on effect on grammatical relations. 

27. I realize that actually proving that Ancient Greek 

Reflex ivizat ion is cyclic is an impossible task, given the unavailability 
of all of the relevant data that could be brought to bear on the matter; 
what is presented here, then, is at best a plausible account that is 
consistent with known facts. 

28. These sentences nre much more acceptahle if neither Raising 
nor Reflexivization occur, inasmuch as Raisins is somewhat limited in 
Modern Greek and other means of expressing reflexivity (e.g. through the 
use of mediopassive verhal morphology) are not only available but 
generally preferred. Nonetheless, there is no doubt that these rules are 
part of the grammar of Modern Greek, so that their interaction and 
application in the same "derivation" is possihle in principle. Moreover, 
although somewhat unusual in nature (note the awkwardness of the English 
translation), all Greek speakers that I have consulted with on this and 
related sentences — some 20 in all from various parts of Greece and from a 
variety of backgrounds— have uniform judgments. 

29. This result and related conclusions on the empirical content 
of the cycle as a syntactic construct have been discussed in Joseph and 
Perlmutter 1979; T hope to make the results more readily available in a 
forthcoming monographic study of Raising in Greek. 

30. I have carried out about as thorough a search as possible 
through the vernacular texts of Medieval Greek, covering literally 
thousands of pages of documents; see Joseph (1978: Chapter 1) for a 
discussion of the methodology and references regarding the texts examined. 
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Loss of Nominal Case Endings in the Modern Arabic Sedentary Dialects* 



1.0. Introduction 

Classical Arabic (CA), which is widely accepted as representing pre- 
Islamic Arabic speech and therefore the ancestor language of the present-day 
Arabic dialects (cf. Birkeland 1952; Blau 1961, 1965, 1966-67; Ferguson 1959; 
Fiick 1955), had nominal case endings, while the modern sedentary (non-Bedouin 
—generally, urban) dialects do not have these endings. Since the modern 
sedentary dialects differ from each other in a number of ways— even to the 
extent that a number of them are not mutually intelligible — the question 
arise? as to how all these dialects came to have in common the lack of 
nominal case endings. This paper examines some of the evidence that has 
been brought to bear on this question, proposes some different analyses, and 
evaluates several of the existing theories in light of the new analyses. 

Unless otherwise noted, the transcription used here is phonemic and 
uses symbols of the International Phonetic Alphabet. A dot under the 
consonants t, d, s, and z (t, cl, s, z) indicates pharyngeal izat ion. A dot 
under the consonant h (h) indicates a voiceless pharyngeal fricative. 

2.0. The Classical Situation (Nominal Case Endings) and the Modern 
Situation (Reanalyzed Remnants) 

The CA nominal case endings and their modern reflexes are shown in 
Table 1 below. In addition, the markers for feminine gender (^at 'FEM SQ* 
and ^at 'FEM PL') and the marker for indef initeness (^n— called 'nunation' 
in English, 'tanwln' in Arabic) are shown. CA words which are definite do 
not include the indefinite zR kut, rather, end with the vowel which marks the 
case ending (^u 'NOM', zl 'GEN', or z* 'ACC') or with the dual or regular 
masculine plural endings. The parentheses around the t in the modern femi- 
nine singular marker ^at indicate that the t is pronounced only in certain 
environments. These environments are those in which the feminine marker is 
followed by a pronoun or a noun which is in a possessive relationship to the 
noun— the latter called 'construct state' in English, ' idafa ' in Arabic. 
In Levantine Arabic, for example, 'university', lit. 'university-FEM SG' 
(/jamf-at/) is pronounced [J*mS-*]; 'his university', lit. 'university- 
FEM SG-his' is {3«uf-It-u]; 'her university', lit. 'university-FEM SG-her' 
is {jimf-It-h*]; 'Yarmouk University', lit. 'University-FEM SG-Yarmouk' is 
{JimS-It yarmukj; and 'The University of Jordan', lit. 'University-FEM SG 
DBF- Jordan' is n*m9-It Il-?urdun). Note that the t in the feminine marker 
for the modem dual (where pronounced) and the modern regular plural is 
always pronounced since it is followed by a suffix (zla) which is closely 
connected to it. 
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Table 1: Classical Arabic Case Endimte and their Modern Renants 





CLASSICAL ARABIC 


MODERN ARABIC 




Gender 


Case 


Indefinite- 


Gender 


Case Indefinite- 


SINGULAR 


(F) 


(M&F) 


ness (M&F) 


(F) 


(M&F) ness 


Nominative 


-at 


-u 


-n 




Genitive 


-at 


-i 


-ii > > 


-a(t) 




Accusative 


-at 


-a 


-n 






DUAL 


(F) 


(M&F) 


(M&F) 


(F) 


(M&F) (M&F) 


Nominative 


-at 


-ani 


i> 


-at/rf 


-ayn/fi 


Genitive 


-at 


-ayni 


* } > 


(Most dialects have ft, and 


Accusative 


-at 


-ayni 


i> 


plural has replaced dual) 


REGULAR PLURAL (F) 


(M)/(F) (F) 


(F) 


(M)/(F) (F) 


Nominative 


-at 


-una/- 


•u -n 




Genitive 


-at 


-ina/- 


i -n } > 


-at 




Accusative 


-at 


-ina/- 


•i -n 






IRREGULAR (BROKBN) PLURAL = STEM CHANGING 










(M&F) 


(M&F) 




(M&F) (M&F) 


Nominative 


i 


-u 


-n 






Genitive 




-i 


-n } > 






Accusative 




-a 


-n 







As Table 1 shows, nouns in the modern dialects have generally undergone 
four changes from CA in phonology and morphology: 

(1, 2) Phonology: unconditioned loss of indefinite and ^V, and 

conditioned loss of feminine singular ^t. 
(3, 4) Morphology: merger of the nominative marker with the genitive/ 

accusative marker in the regular plural and — in those dialects 

that retain it — the dual. 

These changes have resulted in the sedentary dialects losing their nominal 
case distinctions. 

It is unclear, however, how all these dialects have come to have this 
same change in common since the dialects are spread out over a vast area. 
Several theories have been advanced which specifically account for this 
phenomenon by postulating phonological and morphological changes that led to 
it. Prominent among them are those of Birkeland (1952), Cantineau (1953), 
and Blau (1961, 1965, 1966-67), all outlined below. 

2.1. Birke land's Theory 

Harris Birkeland (1952), drawing on the observation that Classical 
Arabic had pausal (citation) forms which were essentially like the modern 
forms (except for the nominative/oblique merger), took these forms as the 
origin of the modem dialectal forms. That is, in CA -(t)V(n)* in singular 
and broken plural forms in context (non-pause) position hecame in pause 
position (in isolation and sentence finally) in the nominative and genitive, 
and it became ^§ in the accusative. Furthermore, some Old Arabic dialects 
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had pausal forms which ended in j4 for all the cases. Birkeland proposed that 
these reduced pausal forms of these old dialects were then generalized to 
context position in a later stage of the dialects so that forms representing 
more categories replaced forms representing fewer categories: the earlier 
system with one form representing each of the nominative, genitive, and 
accusative cases gave way to a system with one form — f$ — representing all 
three cases. Birkeland stated that this conclusion is the only one possible 
because: 

(1) we know that CA and some old dialects had both context forms and 

pausal forms; 

(2) the modem sedentary dialects have only pausal forms, with context 

forms as relics in places that could not have pausal forms (the 
construct state, or idafa ) ; 

(3) therefore, the form that survived had to have replaced the lost 

form. 

Even though this conclusion is not explicit as to how the replacement 
happened, it ia a plausible explanation of the changes in nouns that took 
place between CA and the modern dialects. 

2.2. Cantineau's Theory 

Jean Cantineau (1953) proposed that the loss of case endings was 
brought about by a phonetic sound change which dropped short final vowels, 
plus a morphological rebuilding of the case system, in the following steps. 

(1) Short vowels (especially u and i) were weakened and so were 

subject to loss in open syllables. Therefore, first the nomina- 
tive marker ^u became £*» and then the genitive marker became 
ff. After these changes, only the accusative marker ^a remained. 

(2) The case system underwent morphological rebuilding to lose the 

nominative and genitive distinctions in indefinite nouns, too 
(by analogy to definite nouns): nun became fj, and z±R became f*. 

(3) A phonetic sound change made context z& a "d pause (<r9S) 

become f*. After this change, context -an was the only case 
ending left. 

(4) Then in context became f$ due to morphological rebuilding (by 

analogy to the other foiins which had j4 endings already). 

2.3. Blnu's Theory 

Joshua Blau (1961, 1965, 1966-67) maintained that the modern Arabic dia- 
lects grew out of Middle Arabic dialects which diverged from CA as CA spread 
outside the Arabian Peninsula during the Islamic conquests (c.a. 632-800 
A.D.). These new dialects differed from each other because they developed 
in different towns, but they all lost case (and mood) endings due to (1) the 
influence of the foreign languages which did not have case endings. (2) the 
stress changing from weakly centralizing to strongly centralizing, * and (3) 
the generalization of pausal forms to context position. He argued that 
these changes occurred in the following steps. 

(1) Short vowels in open syllables (especially word finally) were 
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weakened and therefore tended to drop, u and i dropped first 
because they were weaker than a. This resulted in. nominative 
and genitive definite singular nouns, feminine sound (regular) 
plurals, and hroken (irregular) plurals losing ^u and ^i. 

(2) Nominative and genitive pause forms were extended to context, so 

that -un and -in hecame fi, 

(3) Word final long vowels became ahort so that pausal accusative 

from context -an became ^a. 

(4) a was weakened and then dropped in open (especially final) sylla- 

bles, so accusative became ji. At this stage, -an in. context 
was the only vestige of the former case markers left, no longer 
signifying case since the system had broken down so much. 

(5) Accusative pausal forms (with jtf ending) were optionally extended 

to context, so that no final case markers were left except 
optionally. 

(6) The oblique case markers of the dual ( -ayn ) and the masculine 

sound (regular) plural (-in) replaced the nominative markers 
( -an and -un . respectively) , since there was no longer a 
need to distinguish cases. 

3.0. Evidence Which Illuminates These Theories 

All of these theories deal with plausible types of changes, and so, 
since they are not mutually exclusive, it ia possible that any or — as Blau 
argues — all of the factors which they propose could have contributed to the 
loss of case endings in the Arabic dialects. The task, then, is to find 
evidence that sheds light on what probably occurred, so that the amount of 
speculation necessary about what possibly occurred can be minimized. 

There is a body of documents available which provides such evidence and 
which scholars in general — including those mentioned above (except Blau) — 
had not conaidered when developing their theories. ^ These are the writings 
of non-Arabs during the first five or so centuries of Arab rule (approxi- 
mately the 8th through the 12th centuries A.D.). Blau (1961, 1965, 1966-67), 
who has analyzed hundreds of these writings, maintains that they provide 
infoimation about characteristics of colloquial Arabic immediately following 
the Islamic conquests. As such, they are the oldest documents available 
which reveal the colloquial speech after the conquests provided the oppor- 
tunity for extensive changes in Arabic to take place, due to the interming- 
ling of Arabs from different areas in military campaigns and settlements and 
to the learning of Arabic by the conquered non-Arabs. They thus reveal a 
stage of Arabic which is intermediate between Classical Arabic, which had 
case endings, and the later stage of dialectal Arabic which does not have 
case endings (Blau's 'Modern Arabic* ). Blau termed this intermediate stage 
'Middle Arabic (MA). 1 As an intermediate stage, MA provides information 
about sense of the steps the language went through as it changed from the CA 
type to the modern dialectal type. 

These texts are written in CA, which was the standard written language, 
and the characteristics of MA are revealed in them as deviations from CA. 
Blau points out that there are no known texts written in colloquial MA, so 
the best that can be done to ascertain the traits of MA is to analyze texts 
of CA which contain deviations. He states that these texts are very 
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revealing sources of MA since they contain numerous deviations. He argues 
that such deviations represent either intrusions from the spoken language or 
hyper- or hypo -correct ions since the writers were generally trying to write 
in the prestigious standard language (CA). 

The deviations appear almost exclusively in manuscripts written l>y Jews 
and Christians who wrote (usually copied) mainly religious texts in their new 
language— Arabic. Blau notes that while a few colloquialisms occui in offi- 
cial Muslim papyri of this time, they do not occur often because, as the 
language of their religion, CA was an extremely high ideal for Arabs. Conse- 
quently, Arabs were very careful not to let many colloquialisms enter their 
writing, while non-Arabs were cither not as careful or not as able since CA 
was not such a high ideal or as familiar for them. Even so, Blau points out 
that the few colloquialisms which occur in Arab papyri and poetry at the 
beginning of the 8th century A.D. have the same basic characteristics as 
those which occur in non-Arab texts. Therefore it can be assumed that MA 
was in use as early as this and that the Arabic spoken by Arabs at this time 
had the same basic characteristics as that spoken by non- Arabs and revealed 
in their writings. 3 

According to Blau (1961, 1966-67), among the non-Arabs, the texts which 
reveal the most about the spoken language of this tine are those written by 
Christians in Southern Palestine for other Christians because there are many 
more texts available from this area than from the other areas which produced 
such texts. Furthermore, these writings include the earliest dated documents 
which include numerous examples of MA and numerous manuscripts which were 
wi itten in the monasteries there in the second half of the 9th and the 10th 
centuries. They also include a number of undated manuscripts with niunerous 
examples of MA for which there is evidence that they were wntten there in 
the 8th century — seme as early as the beginning of the century. Most of 
these are translations from Greek and Syiiac, but some are originals in 
Arabic, showing that the native non-Arabs did, indeed, produce this type of 
writing. As Blau points out, the dialect chaiactei istics revealed in these 
documents are not homogeneous with the characteristics revealed in documents 
from othei areas, othei religions, oi other times. However, his studies have 
shown that the basic features of all these different dialects are the same, 
and so Southern Palestinian Chi istian Arabic: — oi Arabic of Southern Pales- 
tine, abbreviated ASP by Blau 1966-67— can reasonably be used to represent 
MA as a whole, while also noting the deviations in the documents which repre- 
sent only ASP or only the particular copyist. Blau (1966-67) does just this, 
and so the present study looks at the ASP deviations which Blau indicates are 
also common to other MA dialects. 

Blau (1961, 1966-67) notes that precautions must be taken when analysing 
MA texts because some of the deviations from CA do not represent the spoken 
Arabic of the time. For example, a number of the deviations are pseudo- 
corrections, which are a mixture of standard and colloquial features, result 
ing from the writers trying to use CA but not always applying its rules cor- 
rectly. Types of pseudo-corrections which are found in the texts include 
inolapropisms (such as writing lasiyyam a for la siy yama 'especially'; 
Blau 1966-67: 50), use of CA forms where they are not appropriate (called 
'hyper • oi 'over correction*- such as use of the prestigious nominative 
case where the less prestigious oblique case is appropriate; Blau 1966-67: 
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51), and mixtures of MA forms with CA forms (called 'hypo-' or 'half- 
correction' —such as use of a dual verb before a dual subject, when CA used 
a singular verb before a duol subject, and MA used a plural verb heforr a 
dual subject; Blau 1966-67: 51). Blau notes that the ASP texts also show 
influences from the other language spoken in the area— Aramaic— as well as 
loan translations from the languages that many of the texts were originally 
written in— Greek and Syriae. The texts also show influences from CA spell- 
ing (such as usually spelling words which had CA or £ with their respective 
CA letters even though these sounds had probably merged in ASP; Blau 1966-67: 
56, 113-114) and from traditional literary features which had disappeared 
from the spoken language (such as following an imperfect verb which ended in 
a long vowel with the symbol for ^n when the dialectal pronunciation no 
longer included the ^ Blau 1966-67: 57). Therefore, in order to identify 
the true MA features from these texts and weed out the pseudo-corrections 
and other deviations from CA which did not lepresent influences from collo- 
quial Arabic, Blau (1965,. 1966-67) listed in his studies of Judaeo-Arabic 
and ASP texts only those features which occurred in a number of the texts as 
reliable features of MA, because they recurred. The present study relies 
only on these recurrent MA features of ASP which Blau compiled. 

4.0. What These Texts Show about thfi Loss of Nominal Ca s e Endings 

Blau's (1966-67) compilation of a grammar of Christian Arabic based on 
his analysis of numerous grammatical characteristics of the Southern. Pales- 
tinian texts includes a number of conclusions about the historical changes 
that the language underwent to reach this stage of Middle Arabic. A ^analy- 
sis of the data he considered points to some additional conclusions, sone 
different conclusions, and some of the sajfte conclusions, as discussed below. 

4 • 1 • A Stre ss Shift Could Have Occurred 

Blau notes that while long vowels are generally indicated in these 
texts, short vowels generally are not, making it difficult to draw conclu- 
sions about ASP based on the occurrence or nonoccurrence of vowels. How- 
ever, the places where vowels are indicated show that some of the vowels 
(short vowels more than long vowels) were sometimes written with symbols 
which indicated a different vowel quality than the vowels had in CA, and 
that long vowels were often shortened in final open syllables and short 
vowels were often dropped in open unstressed (especially final) syllables. 
The changes in shart vowels are shown mainly by an 8th century fragment of 
Psalm 78 which is written all in Greek letters and includes the original 
Greek text and a translation into Arabic. Since it is written in Greok 
letters, it indicates all the Arabic vowels, including the short vowels— 
which the Arabic script generally does not indicate. It thus provides a 
rare window on the full vocalization of Arabic at this time. 

A reannlysis of the data cited by Blau (1966-67) supports his conclu- 
sions (p. 44) that these general trends occurred. The fact that the data 
bears out his conclusion that 'the quality of the short vowels was rather 
inconstant » supports his subsequent conclusion that the vowels in ASP 'were 
weakened, thus becoming liable to change and elimination. 1 The inconstancy 
of ASP's vowels is shown in the examples that Blau (1966-67: 63-65) cites of 
ASP letters which represent different vowel qualities from CA, listed below 
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in Table 2. 



Table 2; ASP Words with Vowel Qualities that Differ from CA 



Number of ASP Vowe 1 
instanc es (underlined) 



(from Blau 1966-67: 63-65} 

for JCA Vowel 
(underlined) 



Myaning 



(1-3) 

W 
(5) 

(6-7) 

(8) 



(1) 
(2) 



otXtvlitv 

f + + tXlT 



e~a (same text) 



wa-l~?awdiya 'and the streams* 
and-DEF-streams 

la?al 1 'perhaps* 

ya-qdir 'can* 
odASC SG IMPERF-can 

istafal-at 'was kindled* 
kindle/PASSlVE 3FEM SG PERF* 

fa-sal -at 'and it gushed out 
aud-gush out-3FRM SG PERF 



a 

sama 
la-hum 
to- them 



'heaven* 
'to them* 



(3 



(1) 

(2) 
(3) 



u a)* 
inuqadira ina* qadir-a 

noun-can-FEM SG 
yudruh ya-<Jruh 

3MASC SG IMPERF-beat 
yusjr ya-sir 

3MASC SG IMPERF-becoroe 



'ability* 

'he will beat* 

'he will become 1 



(1) 

(2) 



c 



jiddan 'much' 
istaSal-at 'was k'r.dled* 

kindle/PASSlVK-3FEM SG PKRF 



(1) 

(2) 



n 

sulm 
mush 



i 

silm 
misfc 



peace 

'haircloth* 



(1) 



(1) 

(2) 

(3) 



tuyur 



e (written as a 

a in Arabic) 
mamra mamra 
f**iXtr fa-sal -at 

and-gush out-3FEM SG PERK 
AUIXkk Udalik 



'fowels' 



'Mature' 

'and it gushed out 
'therefore* 



♦The parentheses around the listing of ASP u wiitten for CA a indicate 
that these instances may represent morphological, rather than phonetic. 
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substitution.. This is so because all of the examples that Blau cites 
exhibit the substitution in a prefix: one instance of mu- for ma- 
(prefixes for verbal nouns), and two instances of yu- for ya_2; (prefixes 
for imperfect active verbs). Since these prefixes which contain the u 
occur in Arabic— and frequently — it would not be surprising if the 
non-native speakers of Arabic occasionally mixed up the prefixes which 
contained u and a. If ASF u for CA a were a phonological change, one 
would expect to also find it in environments other than those which are 
morphologically defined (here, prefixes). Therefore, it seems that the 
u for a substitutions noted by Blau and listed here should not be 
included in data shovr'ng that vowel quality in ASP was inconstant. 

The data in Table 2 shows that CA a, i, u, and a were subject to phonologi- 
cal change in ASP and that, in general, the change was centralization: a>e, 
i>e, and a>e. Also, nccasionally i and u were interchanged. Centraliza- 
tion could have been a reason for this, too, if the pronunciation of these 
vowels diverged from peripheral toward central so that hearers perceived them 
as falling within the opposite phoneme boundary. All these changes point to 
a situation in which these four vowels varied from their CA pronunciations, 
at least sometimes, enough that ASP hearers (including writers) perceived 
them as different vowels, and then ASP writers wrote them as the different 
vowels. In such a situation, it would not be unusual that fewer of the long 
vowels varied in their pronunciations than the short vowels did (as this data 
shows — only a; not I or G; but a, i, and u) since their longer duration 
would have made them more resistant to centralization, both in production 
and in peiccption. 

Along with this inconstancy of vowel quality, the data listed by Blau 
also indicates that, in contrast to CA, long vowels were shortened in final 
open syllables, and short vowels were deleted in open unstressed sylablea — 
especially word finally. Some of the evidence cited by Blau in support of 
the first claim is that words which end in CA 2.0 are sometimes written 
with .-J* in ASP, and CA ^1 is sometimes written as in ASP. The second 
claim is supported by Blau's report that a symbol indicating the lack of a 
vowel (Arabic sukun, symbolized *) following the consonant it is written 
above is sometime., written in ASP at the ends of words which ended in a 
short vowel in CA, The loss of short vowels in open unstressed syllables in 
ASP is further suppoi ted by Blau's observation that a symbol indicating 
glottal stop followed by a vowel (Arabic ?alif , symbolized I) is sometimes 
added before an jnitial consonant that was followed bv - «»iort vowel in an 
open unstressed syllable in CA. Blau reasons that a vowel was added before 
the initial consonant of the word because the unstressed vowel following 
this consonant had been dropped. The vowel was inserted, apparently, in 
order to break up the consonant cluster which resulted when the unstressed 
vowel was dropped- a phenomenon which is common in Arabic. For example, CA 
SI yhm (|9alayhlmj) was written in ASP as iflyhm ( I ?af layhlm] ) . 

Blau (1969: 221, 1965: 45) states that the changes in the vowels in ASP 
des cubed above blayed an unportnnt role in bringing about the loss of case 
endings and that a factor in bringing about these vowel changes was a change 
in stress. He claims that CA must have hod weakly centralizing stress (see 
Footnote 1> because short vowels were preserved in open unstressed syllables, 
but. that the stress must have shifted tu strongly centralizing in ASP because 





64 



short vowels were then blurred in open unstressed syllables, as described 
above. q While it seems likely that the changer, in vowels icspeeiolly loss 
of final short vowels) contributed giently to the loss of the case endings— 
some of which wore marked solely by particular final short vowels— it is not 
clear whether a change of stress did 01 did not bring about these vowel 
changes. 

The conclusion that n stress shift occui red is consistent with the 
facts, and so it is a possible explanation for them. It is widely accepted 
that vowels which get centralized (i educed) are unstressed and often occur 
in open syllables, especially at the ends uf words. Suite centralization 
weakens the vowels (makes them less peiceptually distinct), such vowels are 
often subsequently lost altogether. Therefore, it would be expected that if 
the stress in Arahic had changed from CA to MA in such a way as to favor 
centralization of vowels more than it had before, then more vowels than 
before would show centralization and possihly total loss in these environ- 
ments. Since this prediction describes the phenomena exhibited for the 
vowels which occur in the ASP texts, the conclusion could be reached for 
this stage of Arabic that a shift in the type of stress had occurred. 

However, such a conclusion is not required by the facts. Vowel cen- 
tralization can occur whenever a syllable is unstressed; it does not need to 
he preceded by n shift in stress. Th* syllables in which the vowels were 
reduced or lost in MA could also have been unstressed in CA but not have 
undergone vowel reduction or loss yet. If this was the case (and there is 
no evidence that it was nut the case), then ASP would simply be the stage at 
which the vowel changes occurred, after the impetus for the changes was set 
up at an earlier stage. Therefore, since such a situation does not requue 
positing that a shift in stiess occurred between CA and MA, the vowel phe- 
nomena do not show that there had necessarily been a shvfL stres«, they 
only show that there could have been a shift in stress. 

So these data show that Cantineau* s and Blau's theories that the loss 
of case endings began with a shift in stress could *>e right but may not bo. 
The fact that they give enough information to show that these theories could 
be correct is a step forwaid from the argumentation supplied by Cantineau 
and Blati, who extrapolated their conclusions from only a few facts. The 
fact that these data show that there is not enough information to confiim 
these theories is nlso a step forward, since Cantineau and Blau both assert 
that a stress shift did occur, implying that the evidence definitely 
supports such a conclusion. 

4.2. Nominal _ Case. Eiidings .May_Npt. Ha ve Been._Lps t_ (Jqit.pl et e 1 y Ye t 

The ASP texts give evidence that the nominal case endings had been lost 
at the ends of words hy this time, supporting Hlau's (19G1, 1965, 1966 67) 
claim that the case endings had completely disappeared by the time of ASP. 
However, contrary to Blau's claim, the ASP texts also fcive evidence that the 
case endings way not yet have disappeared when followed by a pronoun suffix. 
The evidence suppurting this situation of partial preservation of the case 
endings at this time is examined below. 

4.2. 1. Case, Jndings at the Ends of Words 
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Blau (1967: 317-318) reports that, except for one word (U£U) for 
Jiddan 'much', where z* for z*Z marks the accusative case — discussed below 
in Section 4.2.4), the Greek/Arabic fragment of Psalu 78 exhibits no case 
endings at the ends of words. The examples that Blau gives are listed below. 
Here, and in the rest of this paper unless otherwise noted, underlining of a 
blank space in the ASP text indicates the place where a CA letter would have 
occurred. Underlining in the corresponding CA word shows the CA letters 
that are not indicated in the ASP text. 



ASP: for CA: 



(1) o£*£oayJj wa-xubz-a-n . . .ma?id- at-a-n 

tikUtv and-bread-ACC- INDEF . ..table- FEM SG - ACC - INDEF 



'and bread... table' 



(2) Xv)t°V/f-_ luhum-a-n 

meat- ACC - INDEF 

'meat' 



Case endings are not listed in the Greek/Arabic psalm even when the noun 
occurs in the 'construct state' (' idafa construction' in Arabic)— a syntac- 
tic construction made up of a series of nouns which indicate possession of 
by N2 and— if three nouns occur— N2 by N3. In this construction, the nouns 
are very closely tied together and therefore— except the final word— would 
not be pronounced in their pause forms in CA. Blau states that in ASP, how- 
ever, such nouns are writter. in their pause forms, and he gives the following 
example. 



ASP : for CA : 

oa/i t*k-.f«/{\ - . wa-miei-i raml-i-l-buhur 

cXflovJLQVf and-as-flEN sand-fiEN-DEF-sea 

'and as the sand of the sea' 



In this example, the case vowel of CA miOli was not written in the ASP 
text, and Blau says that the case vowel of CA rawli also was not written in 
the ASP. Since Greek which occurs in this text after the Greek for rami 
— is a vowel, though, this could be the i of rainli . This possibility i s not 
likely, since the dots in the Greek rendering of ASP apparently indicate word 
boundaries, but it should be considered and investigated further. In any 
case, this example shows that ASP dropped at least some case endings in this 
construction. Since this position is so resistant to deletion, this is 
strong evidence that ASP had either, as Blau claims, totally lost the oase 
distinctions which formerly occurred at the ends of words (if the Greek t 
was not the i from ramli ). or nearly lost these distinctions (if the t was 
the i from ramli). 

Two other examples which Blau (1967: 320) cites as evidence that the 
cases had been lost in ASP actually show that the form which marked the 
cases in CA did no jlways disappear — sometimes it just ceased to. carry out 
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its former function. In these two examples, lis led below, the noun ?ax 
'brother* , which is ui the construct state, ends in the form of a CA case 
ending which is an incorrect ending for this context: _u in Vaxju 
f 'brother-NOM') signalled the nominative case in CA, but this "construct ion 
required a genitive murker (^1) on this noun. 

ASP: 

m Ui) yfa)9qGh ?{a)x-u r(u)b-isa 
to James brother* NOM lord-out* 

'to James, the brother of our lord' 



(2) r(i)sal-a y(a)9qub ?(a)x~u r(u)b-no 

epist le -FEM.SG Jnwes brother^ NOM lord- our 

'the epistle of James, the brother of our lord' 



Therefore, these endings, while retained in form at the end of the word, 
appear to no longer be functional as case markers. This situation i» dis- 
cussed in detail in the next section. 



4.2.2. Case Endings Before Pronoun Suffixe s 

Blau (1967: 318 n.3) points out that sometimes the CA case endings were 
omitted in ASP before attached piononn suffixes, and he claims that these 
instances represent the actual ASP usage. The two examples he cites of this 
type of omission, from the Greek/Arabic Psalm 78, are: 



ASP ; for CA : 

(1) pt.v.otv- .{y/ sahw-at -a-hum 

desire -FEM SG-ACC their (MASC) 

'their desire' 

(2) Kc«f.er_.ov quds-i-hl 

sanctuary-GEN-hi s 

"of his sanctuary* (no overt preposition) 

He notes that at other times the case endings occurred in this position in 
ASP, and he claims that these instances were not tim genet al usage but were due to 
the influence of CA^ that is, that they were hypei corrections. The two examples 
he gives of this are the following, with the case en.dings underlined. 



ASP: for CA: 

(1) ^CJotu.oo(7.c v/i bi«?awGan-i-hiifl 

with-idoJs~GRN their (MASC) 

'with their idols* 
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(2) ^J./fCY.)£OV»re. 



hi-manxut-at-i-him 

with- graven Snage-FEM PL-GEN-their (MASC) 



'with their graven images' 



If these occurrences of case endings are hyper-corrections, it is 
curious that they occur only before a pronoun suffix and never at the end of 
a word. If the writer was correcting his Arabic according to the rules of 
CA, it would be expected that he would have at least occasionally written 
case endings in the most obvious place t^iey occur in CA — at the ends of 
words. Since he did not do this, it raises the question of whether the 
instances of case endings before the pronoun suffixes are, indeed, instances 
of hyper-correction. 

The alternative is that these rase endings represent the actual usage 
of the time. Perhaps case ending vowels had not been totally lost yet but 
were still pronounced — at least sometimes — when they were not at the end of 
a word. This is plausible, since such vowels would have been protected by 
the suffixes which attached after them, so they would have heeu less suscep- 
tible to changes that affected the ends of words than vowels which came at 
the ends of words would have been. 

Additional evidence in support of this hypothesis comes from the 
examples Bleu (1967: 318 321) gives of vowels that represent cases in CA 
which occur in ASP texts after ?ab 'father* and ?ax 'brother. 1 Blau cites 
124 instances where this happens in a number of manuscripts. Of the 12 
examples that he writes out fully (which include 26 instances of ?ab or ?ax) . 
all hut two (those listed above) have pronoun suffixes attached after the 
vowels. This is a substantial increase over the examples noted by Blau in 
both the number of examples and the number of manuscripts in which these 
vowels were written in ASP. Therefore, these examples make it look more 
plausible titan Blau indicates that at this time what had been case vowels in 
CA continued to be pronounced when they occurred 'before pronoun suffixes (as 
well as sometimes without the suffixes). 

This evidence is not unquestionably supportive of the theory just 
advanced, and the theory is not without qualifications. But the possibility 
that CA case vowels were pronounced in ASP at least sometimes (possibly 
mainly before pronoun suffixes) is one of several scenarios that would 
explain all this evidence without leaving problematic exceptions that need 
to be explained as influence from CA in only limited environments — as Blau's 
theory does. All these explanatory theories deserve to be considered, and 
so the qualifications of the above theory as well as the rest of the theories 
are discussed below. 

One of the i>rohlo)as that must he accounted for is that, as noted above, 
the case vowels which occur in the ASP texts described here are often the 
wrong vowels for the :ases that should occur in these positions. Of the 124 
examples involving ?ab and ?ax which Blau cites, almost all are examples in 
which the wrong case vowel (often u) was used. The seven which are exam- 
ples of the correct vowel being used are from manuscripts that Blau says are 
grammatically corrected (even though they also exhibit deviations from CA in 
the case vowels;, and so he discounts them. Only the two examples first 
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mentioned in this section from the Greek/ Arab it psalm and one instance of 
Tab in an example that includes three instances of Tab --have the correct CA 
vowel for the case ending without the possibility of it having been corrected 
after the original writing. The theory that the case vowels were sometimes 
pronounced in ASP must therefore be qualified to account foi many of these 
vowels being wrong. Three possibilities exist to account for these vowels 
in the thnory just proposed. 

One possibility is that a vowel was often pronounced in the case marking 
position but that the particular vowel always or often varied randomly so 
that sometimes it matched the CA vowol used to mark the? particular case, and 
sometimes it did not. Such a situation would have occurred if ASP had a 
ruin to insert a vowel — but, for many people at least, not any particular 
vowel but often u — after a noun in particular environments (mainly before 
a pronoun suffix). If this was a rule that not everyone used or that was 
violated occasionally, the few exceptions in the examples examined here in 
which a vowel was not inserted between a noun and a pronoun suffix would be 
explained. If this rule was sometimes extended to nouns in the construct 
state, the two examples examined here of a wrong ease vowel being used in 
the construct state would be explained. In such a situation, the system of 
case marking would have either broken down entirely already or would have 
been in the process of breaking down, depending on whether some speakers 
still had a sense of case marking- even possibly including some rules to 
place the correct CA vowel in the correct position at particular times. 

The second possibility is that the case vowels were pronounced only in 
the environments exhibited here. Thai is, the words Tab 'father', Tax 
'brother', VawGan 'idols', and manxut at 'graven images'-- or religious 
words in general- might have been preseivcd longer in their older foims (fol 
lowed by case vowels) than other nouns in ASP were, because of having a spe- 
cial status as religious vocabulary and because of being repeated often in 
their old CA forms in religious contexls, or as the result of loss by diffu- 
sion. The forms followed by pronoun suffixes could have been preseivcd 
longer than the forms without these suffixes, due to protection by the suf- 
fixes. In thia situation, ASP ui sonic spedkeis of ASP could have had rules 
such as those described for the first possibility above, except that the 
environments would have been specified eithci for these particular words or, 
more generally, for religious words. Tins would be a situation in which the 
system of case marking had broken down even more than it would have for the 
first scenario described above, but- contrary to Rlau's theory- some sense 
of it would still have existed. 

A third scenario if. also possible that the case vowels represented by 
the Arabic ASP writings are indicative of only the writing system and not of 
the spoken language of ASP. In this situation, the vowels concerned are the 
long vowels attached after Tab and Tax, since these are the only case vowels 
that occur in these examples in writings done in the Arabic script. In this 
situation, if the vowels following Tab and Tax are only pai t of the Arabic 
wutmg system foi ASP, then they likely represent an earlier pronunciation, 
and so they still have something to reveal about the history of spoken Arabic 
case endings. For the same reasons at; given above, whenever the spellings 
of the case vowels began to not follow the CA lulcs, they were probably fol- 
lowing either current pronunciations 01 recent pronunciations (.exhibited by 
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a sense that a vowel should be attached after a noun in particular environ- 
ments). So if the case vowels had ceased to be pronounced entirely hy the 
time of ASP, the ASP spellings show that these vowols hail formerly been pro- 
nounced for a longer time bofore pronoun suffixes- -either in religious words 
or generally— than in other environments. In this situation, then, the ASP 
spellings just continue an earliex writing tradition that placed random long 
vowels (often u) in the environments in question. 

There is othei evidence, though, that there war, some awareness of case 
endings at the time of ASP. This is provided by two examples that Blau 
(1966-67: 318 n.3) mentions from the Greek/Arabic psalm. In these, no case 
endings are present, but the vowel of the pronoun suffix has been changed to 
agree with what the vowel of the genitive case ending would have been if it 
had been there, as was done in CA when the genitive ending was present. 
That is, ^hu *he* has become ^hT in vowel harmony with the preceding 
(here, missing) genitive marker zlz* a » shown by the underlinings in the 
reproductions of Blau's examples below. 

ASP : for CA : 

(1) XitpftiX.fi.vi li-sa?h»i-hT 

for«people *GEN-his 

*for his people' 

(2) YoiAci )L*\*&- ll fala xalas-i-hi 

in sal vation-GEN- his 

*in his salvation' 

Blau tonus this phenomenon Remarkable 1 and attributes it to hypo- correction 
-•a mixture of ASP and CA. 



But this does not have to be seen os a remarkable nnd uncxplainahle phe- 
nomenon except by appeal to the influence of CA. These words could, in fact, 
show the real ASP usage— that the pronoun vowel was pronounced in these words 
so as to represent the genitive case ending in some way. There are three 
possibilities for the way this could have been done. 

First, the -i of the pronoun could have represented the genitive case 
ending directly. Such a situation could have been brought about by speakers 
being awaits that these expressions should have a case ending but reanalyzing 
the ending and thinking that the case should be marked at the eno of the 
expression rather than at the end of the noun. In such a situation, speakers 
would have placed the case ending at the end cf the expression, replacing the 
original vowel of the pronoun ^hy with the ger.'itive case marker ^i. This 
use of the genitive marker could have been fostered by speakers hearing 
these expressions pronounced with ^i at the very end only — instead of 
: 1 lowing the noun itself. Such a pronunciation could easily have occurred 
if speakers elided the unstressed case vowel after the noun, e.g.: 
LL'sa^btM ii^|a?J>Ju« Then hearers could have reanalyzed . the 
_*J # at the end of the expression as a case marker. 
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Second, even if the rase endings had already been lost after nouns as 
Blau claims the phenomenon here could be explained if speakers were still 
aware of cases and knew that in CA the vowel in the pronoun suffix -hp war. 
changed to f when the construction was in the genitive case. When they 
knew the case of an expression was genitive, then even though the expression 
did not have a case ending to rnaik it- they would have changed the pronoun 
ending -hu to hi. It would have been obvious to Arabic speakers from 
the occurrence of the preposition in these examples that the genitive case 
was appropriate here, and they would have marked this case by changing hu 
lo ,-hT. 

This possibility suggests that case marking phenomena may have been pre- 
served longer when elements in the envu onment made it obvious what the case 
was. None of the four examples cited by Blau (1966 67) in which the case 
endings were lost totally (without even adjustment fot them in the pronoun 
ending) has an overt clement fe.g. preposition; to signal what the case 
should be. On the othet hand, the four examples Blau cites as exceptions to 
his thesis that cast; endings had disappeared totally in ASP fin which either 
'he case endings were preserved or the vowel of the pronoun suffix was 
changed to agree with the absent case eudingj were preceded by an overt ele- 
ment (preposition) which would signal the appropriate case. This is a small 
set of data, though, and so is only suggestive of a possibility rather than 
indicative of a probability. 

Consideration of the examples Blau gives for ?ah and ?ax extends the 
data somewhat and provides support, fot this hypothesis, which was suggested 
above by noting that the eight example.*. Blau gave fot case endings having 
disappeared altogether in ASP do not unefitiivoL.il ly support bis claim. The 
data for ?ab and ?ax do not neatly fit the pattern noted for the previous 
eight examples, but some do, and the icsl do not contradict this hypothesis. 
Of the 121 examples Hlau tfives in which a vowel different from that called 
for by the CA case system follows the noun, six nouns directly follow a 
preposition, as shown below. Since these examples <ire in Arabic script, 
short vowels are not indicated, but case endings are indicated by long 
vowels since these noun;, have atloihod pionouu .suffj.u;.. The prepositions 
and the case vowels which arc attached to the following nouiiS are underlined 
he-low. 

H) bnuwat ?ah*u-iia . . . d*>wal* nfi ?iy£Hi 

sonship falher-NOM-onr. . .call-out* particle -him 

?ab *u *na . • .w -l_ ?ab -r h 

father NOM -our. . .and to father GEN his (the last case vowel is 

correct) 

'to be sons of our father... to call him our father. . .and to his 
father- GEN' 

(2) nrt ?ab*a-hma 

with fathor-ACC their (DU) (incorrect case vowel) 

' with their father- ACC* 
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(3) m£ ?ab"i-hma 

with fat her-GEN- their (DU) (correct case vowel; from a manuscript 

which exhibits corrections) 

' with their father-GBN 1 

(A) h- ?ax-u« ha 

of-hiother-NOM-her (incorrect case vowel) 

'of her brother-NOM' 

(5) gdaiu ?nx"ii *h 

before brother-NOM-his (incorrect case vowel) 

'be fore his brother-NOM' 

(6) ytklm 5J_a ?ax-u-h ?aw 
3MASC.Sfl.IMPERF-speak gains t brother-NOM-his and 

y-din ?ax-u-h 

3MASC.SG.IMPERF-judge brother-NOM-his (incorrect case vowel) 

'he speaks against his hrother- NOM and judges his brother-NOM* 

Of these six examples which begin with a preposition, two follow the noun 
with ;X„ (which is the correct ending in CA when the noun also has an 
attached pronoun suffix), one follows with an accusative marker (a), and 
tiie other three with nominative markers (u). The two which have the geni- 
tive marker following a preposition are further examples of the correct case 
ending occurring when a preposition overtly indicates the genitive case, and 
so they also support the hypothesis that cases remained longer in this type 
of situation. The four other examples mentioned here which have the wrong 
case ending even though they are preceded by a preposition do not provide 
support for this hypothesis, but they are consistent with it. These four 
could well show that even in this situation the sense of case marking was 
also breaking down or was hieaking down for some people. 

The fact that all the other examples which Blau gives of the wrong case 
vowel being used with ?ab and Tax are instances which do not occur with a 
preposition to overtly signal the correct ending makes it all the more 
curious that the only places that the correct ending occurs in all these 
instances are those in which the noun directly follows a preposition. This 
is further support for the hypothesis suggested here. Further anal /sis of 
the data from these manuscripts needs to be done to check this hypothesis 
more thoroughly, but these examples at least indicate that this situation is 
a possibility. It should be noted that if this is borne out, it would con- 
tradict Blau's (1961: 81-82; 1966-67: AS u.49) suggestion that the existence 
of prepositions was a cause for the loss of cases because they made case end- 
ings less necessary by marking one case themselves. It seems, however, that 
if the more detailed analysis suggested here is borne out, then a rejection 
of the more general conclusion reached hy Blau would be warranted. 

There is a third possible explanation besides Blau's for the occurrence 
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of the case vowel before the pronoun suffixes and for the changed vowel in 
the pronoun suffixes even when the genitive case ending was not present. 
These occurrences might have been borrowings from CA which originally con- 
tained the genitive ending _i as well as a changed pronoun vowel, but 
subsequently lost the genitive J[ while retaining the changed pronoun vowel 
because it was fixed that way in these expiessions. This looks reasonable 
for tho four expressions that do not contain ?ab and ?ax (which Blau treats 
in a section by themselves). These foui ore Tepeated below, with the case 
marker and changed pronoun vowel underlined. 

ASP: 

( 1 ) 1>i- ?awQan* \\ ■hiin 

w i t h- i do 1 s-GES« t he jr (MASC ) 

'with their idols-GBN 1 

(2) bi-manxul-at'-i-hjm 

with- graven image ~KKM PI -GEN- their (MASC) 

'with their graven images' 

(3) lisnfb- In 

f or-peop 1 e"^-hi,s 

'for his people* 

(4) Sala xalas-^-hi 

in sal vation-fJ* Inn 

'in his salvation 1 

All of these could easily bo expressions that, were ust:d repeatedly in 
religious ceremonies and so became fixed in a form that was closer to the 
original CA than everyday ASP was. 

When the ?qb and ?ax data is considered in relation to tins possibility, 
though, it does not fit in as well as the above four examples do. All of 
these new examples are religious in nature and could easily have been 
repeated often in icligious ceremonies. Yet only two of them have the cor- 
rect case vowel. So in these examples the original CA system was not 
retained as it was in the four examples above, and it appears that these 
examples with fab and ?ax would not have been fixed in their CA form as tho 
previous four could have been. Hven if these four examples were fixed in 
their CA form, it is still curious that the noun in each is immediately pre 
ceded by a preposition. This, again, points to the reasonableness of the 
previous hypothesis. 

Whatever the reason, though, for the changed pronoun suffix in the 
expressions which do not have a preceding genitive case vowel, these expres- 
sions, along with the expressions which lost the case vowel but did not 
change (he pronoun vowel, provide evidence about a cause of the loss of the 
case endings. They show that the loss of '»»♦» «ndjjigs cannot have been due 
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just to the loss of final short vowels without something causing internal 
changes as well- as Cantineou maintained —because these words in which the 
case vowel dropped did not have the vowel at the end of the word. Rather, 
the vowel was inside the word before the pronoun suffix, where it would have 
been protected from loss due to dropping of word final vowels. Instead, the 
loss here must have heen due to either elision of unstressed vowels — dis- 
cussed in Section 4.1 above — or to a generalization from other folios. If 
this loss was due to generalization fron other forms, this suggests two 
possibilities. 

First, the generalization could have heen from pausal forms to context 
fotms — as Birkeland and Blau maintained. In such a situation, speakers 
would have realized that nouns were spoken without their case vowels when 
they were alone or at the ends of utterances, and they could have then 
started pronouncing nouns inside utterances in the same way. This would no 
doubt have been a gradual process, and so one of the last contexts for the ^ 
ending on nouns to he generalized to could well have heen that just sug- 
gested — nouns with attached pronoun suffixes, particularly nouns used in 
religious ceremonies, and particularly nouns directly preceded hy prepo- 
sitions — which overtly indicated the appropriate case. 

Second, the generalization could have come from nouns that had lost 
their case endings due to another reason, such as phonetic change or gen- 
eralization from pause forms. In this situation, speakers would have real- 
ized that some nouns which were in context did not have case endings, so the 
motivation to use case endings there would have disappeared, 8tid speakers 
could gradually have ijuit using case forms in context. Again, such a process 
would have been gradual, so that nouns in environments that obviously indi- 
cated their case, as described above, could well have been the last to lose 
thoir case endings. 

Furthermore, the data containing Tab and Tax which Blau cites also pro- 
vides evidence about a possible cause and a direction of the loss of cqsg 
endings. In this dnta, as Rlau notes, by far the prevalent wrong case ending 
is -u . which is the nominative marker. This suggests that the nominative 
form was being generalized as the form for Tab and Tax in all positions. The 
reason for this is unclear, but a reasonable possibility is that there were 
common religious phrases that it eluded Tab- u 'father-NOM' , which made the 
nominative form of Tab very frequent and the prevalent form in speakers' 
minds. If this happened, then it would have heen easy for speakers to gen- 
eralize Tabu to other positions where Tab was used. Then speakers could 
easily have extended the common use of the nominative case for Tab to the 
closely related Tax 'brother', using Taxu in most positions as well. If 
this happened, speakers could well have heen confused ahout what the appro- 
priate use of the case endings was. Such confusion could have been extended 
to the use of case endings throughout the whole nominal system, contributing 
to their loss. 

It is clear that more questions are heing raised than are heing solved 
by this examination of data showing where ASP used, used incorrectly, and 
did not use CA case endings. A number of new possibilities have been sug- 
gested, though, by this examination, showing that more infonnation can be 
leaned from the data available than has heen recognized so for by 





researchers. Several of the proposals here also show that the data may 
provide a wore coherent whole than has so far been demonstrated. For 
example, the possibility that case endings may have been retained longer 
when followed by a protective environment such as a pronoun suffix, 01 when 
preceded by an overt marker of case snch as a preposition provides an 
explanation for facts that were previously viewed as exceptions to the 
apparent generalizations. In order to settle the questions raised here, a 
wider examination must he done of the data available with the goal of 
verifying or rejecting these proposals. 

4.3. The ,Accu^sative B Case Jit J insular and .Broken PJjjral Nouns. 

4.3.1. The_Byidence 

Blau (1966 67 : 323-345) describes a number of ASP usap.es of the indefi 
nite accusative market ^n which occur both in accordance with and contrary 
to CA usage. First, in the ASP texts, accusative _an sometimes appears where 
it would in CA to mark the triptotic singulai and broken plural, ami some 
times it does not. While Blau does not discuss the frequency of occurrence 
of the accusative marker (except for adverbs -discussed below in Section 
4.3.4), he says the occurrences and nonoccurrences alternate 'freely 1 , which 
indicates that there is no apparent reason (except for adverbs) for their 
occurrence or nonoccurrence. His conclusion is that this is evidence that 
the cases have already disappeared and that ASP is a mixture of MA and CA. 

Second, sometimes accusative .an occurs m ASP texts whore it would not 
in CA. One of these instances is more common than the others— to mark 
adverbs regardless of case {discussed below in Section 4.3.4). Of the othei 
instances which are less widespread in the ASP texts, Blau notes that some 
occur in the same categories as in modern Bedouin dialects, and others do 
not. While stating that all of these usages probably ocenn ed in ASP speech, 
Blau attributes their occurrence in the ASP texts to hyper correct ion. He 
reasons that, since the most common Bedouin usage of _ an which is contrary to 
CA usage (marking indefinite nouns followed by an attribute) is not found in 
the ASP texts, then there must have been no hyper collective factors to biing 
it into the texts and that, therefore, the other occurrences of _an contrary 
to CA usage are due to hyper correction. He notes that most of these 
instances would have required the nominative case and a few of them the 
genitive case in CA, but he draws no conclusions from these obset vat ions. 

Third, in a manusci ipt from the 10th century A.D., accusative _mi is 
often used in every syntactic environment, replacing even the nominative and 
Benitivc endings that would have been appropriate in CA. Sometimes this 
manuscript also omits 7011 even where it was appiopnale in CA. Blnii states 
that this cannot be an idiosyncrasy of the manusci ipl or of the cop>ist. 
since two later (13th century) manusci ipts which are unrelated to the first 
also exhibit these characteristics. In his ear] 1 or woi k on .ludaeo Arabic, 
Blau (1065: 210 21 lj wonders whether these manuscripts teveal a situation 111 
which ^anwjn -un and m Jjri had already disappeared, but tanwin _nn could 
be used optionally in eveiy syntactic environment....' In his later work on 
ASP, however, Blau (1966-67: 340 11. 84) terms his earlier assumption 'rather 
daring 1 because it would force the postulation of 'a very intricate histoiy 
of development' of the ASP manuscript and its two 1 elated predecessois. So 
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Blau makes no conclusions about this situation, either. 

Once again, these facts point to more information about spoken ASP than 
Blau deduces. As with the data discussed above in Section 3, the data here 
do not show conclusively— contrary to Blau— that case endings had already 
been lost in ASP. The data here do show that the writers of ASP Manuscripts 
were deviating substantially from CA rules and therefore that the case system 
as it existed in CA was not in spoken use any more (if it was, the writers 
would not have deviated from it as much as they did). But this does not mean 
that case endings were completely absent in speech. As with the data in the 
previous section, this data is consistent with the possibility that some sort 
of case system or some sense of a case system existed at the time of ASP, and 
so that possibility merits consideration. 

The fact that in these writings the accusative case was used a number 
of times in place of the CA nominative and genitive cases suggests that the 
accusative case had some psychological reality for ASP speakers. That is, 
-an as ati indefinite case inaiker might have still heen in use enough that 
speakers were aware of it as a case marker and so optionally extended it to 
positions where they knew any indefinite case markers were used. This could 
have happened if the other case markers had been lost faster than the accusa- 
tive marker, so that the •others lost psychological reality as a whole hefo"e 
the accusative marker did. (The others may still have retained psychological 
reality in particular contexts where they were overtly marked, as suggested 
above for the genitive marker when preceded by a preposition.) A reason for 
the accusative marker heing retained longer than the other case markers could 
have been its greater sonority and therefore perceptibility, as a low or mid 
vowel, than the other vowels, which are high. This explanation for the ASP 
data therefore supports the theories of Blau and Cantineau that ^2 was 
retained longer than and zl due to phonetic factors. It also supports 
Cantineau' 3 and Blau's claim that at one point in the history of spoken 
Arahic. -jm was the only case marker left, while the nominative and genitive 
markers had already been lost so that many nouns had f$ endings at this time. 

Furthermore, this theory-- that the accusative marker was retained longer 
than the other case markers and was even optionally extended to the positions 
of the other markers-is appealing because it can explain some problems 
raised by Blau and can tie together some conclusions reached from the ASP 
texts. First, this possibility could explain the lack in the ASP texts of 
-an marking a noun followed by an indefinite attribute in positions that 
would have called for the nominative or genitive case in CA, without having 
to call it a 1 remarkable phenomenon' as Blau (1966-67: 329) does. If -an 
were being or had been generalized in speech to positions formerly occupied 
by only nominative or genitive markers while a sense of the case system 
still remained, use of -an to indicate that a noun in any position was fol- 
lowed by an indefinite attribute would stand out as being contrary to the CA 
case system. It could very well have heen avoided by the ASP writers pre- 
cisely because they used it in their speech but recognized it as a deviation 
from the preferred usage. One does not, then, have to conclude, as Blau 
does, that there were no hyper-correct lve factors at work on this construc- 
tion while there were on the other constructions in which -an appears con* 
trary to CA usage in these texts. While possible, Blau's conclusion seems 
unlikely, since if hyper-correction wss at woik in most of these situations, 
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it is odd that it would never be found in one of them. The other usages of 
M nn could easily have slipped ,nto the writings from speech because they are 
ies" easily identifiable as cunt tar y to CA usage since there are similai CA 
constructions which take the accusative case. In such a scenario, then, all 
the data are accounted for by the same phenomenon, rather than positing the 
existence of one phenomenon in most of the instances hut a lack of its 
existence in one situation. 

Second, this theory could explain tin.* occurrences of an in all syntac 
tic positions in tha one CA manusa ipt without having to posit a complicated 
history of this manuscript and its related manuscript*., which were wiitteu 
in the eentuiy before it was written. If the accusative ease marker gradu- 
ally spread to positions when: formerly only the nominative and genitive 
markers were used, then this manusci ipt could reflect the situation in which 
the accusative marker had finally spread to all these positions. The two 
related manuscripts which were written earlier could reflect an earlier 
situatira in which the accus tive tnarhei had not yet spread to all the other 
positions. Blau (1965: 211) assumes that the use of the accusative case 
optionally in all positions (even those in which it occurred obligatorily in 
CA) would have been the more archaic stage of these two, calling it 'the old- 
est stage of the retention of tanwjn, after the breakdown of the cose sys- 
tem of Classical Arabic..,.' But if the occurrence of ^ajn optionally in all 
syntactic positions is seen as the end of a process of the accusative case 
being generalized to other positions tathet than as the beginning of the loss 
of tauwin, then the use of the accusative case optionally in all positions 
would follow its use in some positions formerly occupied by case markers. In 
such a situation, the problematic ASF* manuscript is no longer a problem 
because its sttueture logically comes later than the structure of its chrono 
logical predecessor r. , so it can ciedihly be seen as repi esent mg that which 
it intuitively seems to represent a stage in the spoken language of ASP. 

Finally, as discussed in the ncxl section, this theoiy of the history of 
the indefinite accusative mat kei in singula* nouns and broken plurals would 
provide a unified account of the htstor> of the accusative case throughout 
the nominal systvm. 

4.3.2. The m Obi i jnue, jCose^ tjtr ou ghou t tho . Norn i n a 1 _ System 

Blau (1966 67; 2 IB 226) indicates that in ASP the oblique (accusative 
and genitive) case had replaced the nominative case of CA in dual nouns and 
in masculine sound (regular) plurals. This is shown by the very f i equent use 
of ,ayj[n^ in the: ASP texts where -ji^nj was used in the nominative of CA 
duals* as shown by the following example. 



The leplocement of the nominatjvi case bv. the oblique cose is also shown by 
the very frequent use of m \tjil in ASP text where u^na/ was used in the 
nominative of CA masculine sound pluralr., as shown by the example below. 



ASP: 

ha Vy "i - 1-naby-ayn 

this/OBL Dll-DKF-prophet 0\\L D1J 



CA: 

hatyair l nabyaii 

Ihis/KOM DU UtiF prophet NQM.UH 



'those-OBL two prophets- OBL* 



'thcse-NOM two prophets-NOM' 
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ASP: 



CA: 

y-ura-l-baran-un 

3MASC . SG . IMPERF-think-DEF- 



y-ura-l-haran-in 
3MASC. SG . IMPERF-think •DEF- 
stranger -OBL MASC pi. 



strsnger- NOM MASC PI. 



•the strangers-OBL think' 



'the strangers-NOM think' 



Since, according to Blau, these usages are so frequent, the conclusion that 
they reflect spoken ASP seems wurranted. The theory proposed here that the 
indefinite accusative marker was generalized throughout the nominal system 
for singular nouns and broken plurals could be combined with Blau's observa- 
tions -that the oblique marker replaced the nominative marker in dual nouns 
and masculine sound plurals— to yield the general hypothesis that in ASP the 
nominative case in nouns was replaced hy an ohlique case. Such a theory is 
appealing because it unifies what have foimerly been treated as unrelated 
phenomena, suggesting that ASP speakers treated the whole nominal case sys- 
tem the same way, rathar than treating its different components separately 
(excluding, 0 f course, instances of analogy which were confined to specific 
lexical items or contexts, such as that described above in Section 4.2.2 for 
?ab and ?yx). 

4.3.3. Genera H zafc i on j> f t : he 1 n 1 1 -rna 1 Oh 1 i que Marke rs 

The timing of these changes is not clear from these texts, though. As 
described above in Section 2, researchers who have included phonetic factors 
in their chronolugy of events in the loss of case endings (cf . Blau and Can- 
tineau) have considered such factors to have played a motivating role at the 
beginning of the chronology of events in the loss uf the case system. 
Because of its neatness, it is certainly appealing to assume, as Blau does, 
that (1) phonetic facturs ant! generalization uf pausal forms to context 
brought ah out the loss of final short vowels, thereby doing away with the 
case distinctions that were marked by final short vowels only; and (2) other 
cose distinctions were subsequently lost, hy analogy to the forms which had 
lust final cl le endings, sinue the reason for the case distinctions had 
become hlurred with the loss of final short vowels. 

However, the generalization of the oblique nominal case markers to nomi- 
native contexts in the dual and the masculine sound plural suggests another 
possible sequence of changes. Since the ublique markers represented two 
cnses in CA while the nominative mnrkers represented only one, speakers could 
easily have generalized the marker which represented the Neatest number of 
coses to the positions of the marker which represented fewer cases— without 
needing prior dropping uf final case vowels elsewhere in the system to blur 
the cose system and trigger these changes. Such a possibility for the begin- 
ning of the loss of case endings is supports by the fact that Blau (1965: 
127 n. 1) mentions that the oblique case occurs twice in the Qur'an for the 
nominative case in the masculine sound plural, while he states that CA pre- 
served short vowels (Blau 1961: 213, 1965; 69, 1966-67: 43), which means that 
the phonetic changes did not happen until Middle Arabic. This shows that 
the generalization of the ublique marker could occur without being aided hy 
the phonetic change. 

If the possibility suggested here had been the motivation for the los3 
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of the case distiucti' i found ly made by internal lo.ig vowels, then the cliro 
noiogy of events in the loss of nominal case endings would plate this event 
as beginning first, foil wed shortly b> the phonetic chunges nml geneinliza 
lion of pausnl forms to context, iii such a scenario, most of the separate 
events would have taken place concoi rent l> , .nut loss of the nominnl case dis 
t Mictions in the dual and the masculine sound pluial could have contributed 
by analogy to the loss of case markets at the ends of words (both with and 
without a following _n to indicate indef Muteness and defini tenets, respec 
tivcly), rathci than vice versa. In tins scenario, the whole nominal system 
would have moved slowly toward the gcliei nl i.'.at ion of oblique markers to all 
contexts, rather than changing one type of wai kei first, and later changing 
other types. The tuning of these changes may never he known conclus ively , 
but this second possibility desei ves to be consideicd with the more popular 
first possibility since these early MA documents suggest that it could he 
plausible. 

4.3.1. Accusative ^an as _nn Adverbial, Marker 

While ihscussing the occurrences of _an in ASP tests (summarized above 
in bvt M.m 4.3.1), Blau (1966-67; 323 32l7"327) notes that adverbs and adeer 
bial constructions which are accusative have tiie ending _an in ASP more fre 
quently than other types of accusatives do (although they, like all occur- 
rences of -an in these texts, are often omitted as well). Furthennore, in 
the Greek/Arabic psalm, _an occurs only on an adverb, and in two manuscripts 
^an is never omitted on adverbs but is omitted other places where it would 
have been appropriate in CA. Blau concludes that _an had been reinterpreted 
as an adverbial suffix only (one of its functions in CA) and that non- 
adverbial occurrence's of _an were no longer seen as case markers hut were 
without function. Then the non adverbial occurrences of an began to be 
lost because they had no function, while the adverbial occurrences of m jm 
were retained longer because they had a function. He states, furthermore, 
that six occurrences of jan on adveibs wheie CA would have used the nowina 
t ive or genitive show that ^an was extended to the,.e new positions because 
it was now seen as an invaiiable adveibial marker and had no function as a 
case marker. 

This conclusion that _an was seen as an invaiiable adverbial marker in 
some instances is reasonable, but the evidence does not require that this is 
the only function that __an had. Since the texts show more frequent 
instances of _an marking adverbs in positions wlieie it would have occurred 
in CA than they show .jni in any other function it had in CA, it seems that 
-on as a marker for adverbs was more salient than .-an as n marker for any of 
its other CA functions. If it had high salience as an adverbial marker, it 
would also have been easy and not surpi ising for speakers to have extended 
it to other words that became interpreted as adverbs, as two of BlnuV. six 
examples indicate. In these, CA Jiinjrt_?i»-in-CEN 'at the time when* was 
interpreted as ono word and was spelled with the ..an suffix in two manu 
scripts: bina?iS _an-ACC 'then*. So these examples do not show that .;an 
had taken on a new, invariable function, but only that one of its CA func- 
tions was still salient and productive. Theie is even an indication that 
this productivity began in late CA, so that ASP was not innovative regard inyf 
the productivity but was continuing a timid that had nlttsady started. Blau 
records that, in ASP, ?awwalan of Inn occurs foi CA ?awyala * formerly 1 , and he 
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notes (1966 67: 324 n.23) that this is attested in late CA. 

fKof BlaU ' S ° ther f ° Ur e5 ? an V 1 ? s » while I^Jng consistent with the hypothesis 
that ^an was seen as an invariable adverbial marker, also do not require this 
as the reason for their use. So they, too, do not show a need for Blau's 
conclusion that zan was seen only as an adverbial marker at the tine of ASP. 
TWo of these occur in the manuscript which uses Z an in every syntactic posi- 
tion, so the reason for the use of ^an on the two adverbs wight easily be 
that the copyist used r a|? everywhere, rather than that the copyist used -an 
to mark adverbs. The other two occur after prepositions, as shown below" 
with the prepositions and the occurrences of ^ underlined. 



ASP : 

(*) hi-yoqin-an 

with-certainty- case marker 

'certainly* 

(3) wa min baftl qalil-an 

and from after little- case m arker 

'and not long after' 

Therefore, these examples may be evidence that -an had become an invariable 
adverbial marker regardless of the case that had been required by CA. How- 
ever, Blau also lists two other examples of ^an used incorrectly (according 

lA n e ASP teXtS followin * a Preposition, when the words it is 
attached to are not advei-hs. These are listed below with the prepositions 
and ^an underlined. 

(1) ?ila wawdu9(sicj 9afi«-an wasiV-an Jiddan 
to place big -case marker wide- case marker very' 

'to a big and very wide place* 

(2) wa-kan~u ?anas min bani ?asqu yuhudiy-an 
and-was-3MASC.PL.PERF people from tribe Sceva Jew-coae^rker 

'and there were some sons of Sceva, a Jew*. 

The existence of these last two uses of : pn following a preposition show that 
sometimes ^an was used incorrectly (according to CA) without being an adverb. 
Therefore, the instances in which it was used incorrectly and was an adverb 
could have been due to the general reason that CA rules often were not fol- 
lowed, rather than to a specific change of ASP using -an as an invariable 
adverbial marker. 

Therefore, since the uses of r^n as an adverbial marker in ASP do not 
necessarily show-contrary to Blau-that ^an was seen in ASP as an invariable 
adverbial marlcer, a different explanation may provide a more consistent 
account of their occurrences. Since, as Blau notes, ^ i 8 often missing 
ASP manuscripts even when it would have been used in CA to mark adverbs in 
the accusative,, this indicates that the sense of ^en as an adverbial marker 
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was not extremely high in ASP. When considering this with the fact that -an 
appears in the ASP texts as sn accusative adverbial Marker More than -it 
occurs to mark other accusatives as it had in CA V the strongest conclusion 
that can be drawn about -an is that ASP speakers retuined a sense of it sore 
as an accusative adverbial Marker than as any other type of Marker, but tliat 
even this function was not extremely salient to then. 

Seen in this perspective, then, the use of -an to Mark adverbs in ASP is 
not very different frost its use to Mark other functions in ASP— contrary to 
Blau's claim. Therefore, this function, which Blau discusses as an exception 
to the pattern he proposes, can instead be seen as part of the general pat- 
tern proposed above in Section 4.3.2 — that ASP was undergoing tlte process of 
extending oblique Markers (including ^an) to all contexts. This would 
explain the last six of Blau's examples discussed above in which -an was used 
in non-accusative contexts — whether Marking an adverb or not. In fact, this 
explanation would provide a coherent account of all the facts about adverbial 
-an , while Blau's account raises the questions discussed above. The adver- 
bial ^au data can, therefore, he taken as additional support of the theory 
proposed here, aince they show one More way that this theory provides a 
coherent account of otherwise somewhst problematic and seemingly unrelated 
facts. 

5.0. Conclusions 

This reanalysis of data provided by Blau (1966-67) on the Middle Arabic 
Southern Palestinian Christian Arabic dialect has suggested a number of new 
conclusions about the characteristics of nouns in this dialect and the 
changes that brought about these characteristics. These conclusions support 
some previous analyses and call others into question. This study has shown 
the following: ■ 

(1) The data is consistent with Blsu's and Cantineau'a claim that a 

stress shift occurred, thereby creating a situation favorable 
for vowels to weaken and dr^p. However, there is not enough 
data to confixm this hypothesis. 

(2) Contrsry to Blau's assertion tltat case endings had been dropped 

already, the data show that case endings had only sometimes been 
dropped at the ends of nouns and before pronoun suffixes. Case 
endings had sometimes been retained in form in these .positions 
but had ceased to carry out their case Marking function. 

(3) Contrary to Csntineau and in support of Birkeland and Blau, the 

data show that loss of the single vowel case endings cannot hsve 
been due just to the loss of final short vowels— something must 
have caused internal changes as well. This could have been 
either elision of unstressed vowels or generalization of pausal 
forms to context position. 

(4) The data support Blau's and Csntineau' s assertion that the accusa- 

tive case May have been the last case lost in singular and broken 
plural nouns. It also suggests More than these theories — that 
the accusucive ending was optionally extended to the positions of 
the other case endings. 

(5) The data show that tlte nominative case May have been replaced with 

the oblique case throughout the nominal system, not just in dual 
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nouus and in Masculine sound plurals as Blau indicates. Further- 
more, generalization of the oblique case may have begun before 
final case vowels were lost. 
(6) As Blau asserts, the data show that accusative ^2" was retained 

more consistently as an adverbial marker than in its other func- 
tions. However, contrary to Blau, the data indicate that this 
was not an exceptional phenomenon l>ut that it was part of vhe 
pattern of generalizing oblique markers throughout the system 
(#4 ahove). 

One final point should be made. The change proposed here that the 
oblique case (which is often considered to be a marked case), rather than 
thy nominative case (which is often considered to he an unmarked case), was 
generalized throughout the nominal system in Arabic does not follow what has 
heen claimed to be the most usual direction of morphological change— that 
unmarked forms generally replace marked foims rather than vice versa (cf. 
Manczak 1957; and Byhee 6 Brewer 1980). The situation proposed here is 
not unknown in changes in case systems, though. For example, the accusative 
case was the basis upon which the singular paradigm was remade between 
Ancient and Modern Greek, and it was generalized i n the Romance languages as 
they evolved from Latin. Although a number of different factors influence 
the direction of morphological change— ma rkedness and frequency being very 
influential, although not always the most influential (cf. Greenherg 1966, 
1969; Matfczak 1957; and Tiersma 1982, who summarizes previous work on 
markeduess and frequency in morphological change and discusses some 
systematic exceptions), it would be reasonable for the ASP oblique 
market — which included the greater number of cases (two)— to be the one thai 
was generalized throughout the system while the nominative markei — which 
included only one case — was lost. 



Notes 



*I would like to thank Brian Joseph for his helpful comments on several 
drafts of this paper. 

1. Blau uses these terms frequently in his discussions of MA but does 
not define them anywhere. The closest he comes to an explanation is to say 
(Blau 1961: 213) that since CA preserves 'ohort vowels in open unstressed 
syllables, it seems necessary to assume a weakly centralizing stress. In 
Middle Arabic, however, stress has become strongly centralizing, as may he 
inferred from a large number of phenomena,...: final short vowels have 
disappeared...; final long ones have been shortened...; and even in the 
interior of the word short vowels in open unstressed syllables have been 
elided....' Since Blau uses preservation vs. shortening and disappearance 
of vowels to define the types of stress, I assume he means that the stres s 
is either weak (which would allow vowels to be preserved) or strong (which 
would allow vowels to be shortened and dropped), and both types al3o result 
in centralization of the vowels. It does not seem to me that he means that 
the vowel centralization was first weak and then strong. 

2. As Blau (1961; 206 207, 220; 1966: 39) notes, Fuck (1950: 5, 57-62) 
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discusses Middle Arabic briefly, but he relies on his intuitions about its 
origins, saying that the details are nearly unknown because there is no 
evidence available from that tine. The manuscripts analyzed by Blau over- 
come this problem, since they begin only two generations after the beginning 
of the Islamic conquests and so provide evidence from essentially the time 
that Arabic had the impetus from the conquests to change drastically. 

3. Blau (1961: 220, 224; 1965: 6-8) states that these early official 
Muslim papyri were probably written by scribes who weie not native Arabs, 
but that, since these scribes were no doubt from the upper stratum of 
society, they were probably imitating the speech of their Arab masters and 
so were reflecting the speech of the Arabs in the documents. He also notes 
that the few deviations from CA found in these papyri are like the devia- 
tions found in Arab poetry and badlth writings of the time. While the 
first reason is rather speculative, the other two reasons are more conclu- 
sive, and so the conclusion seems reasonable. 

4. In his summary of Judaeo-Arabic, Blau (1965) specifically declines 
to take a stand on the role that a change in stress played in the loss of 
the case endings for this dialect. However, Blau (1965: 168-169) argues 
that hypothetical phonetic laws and extension of pausal forms to context 
probably brought about the loss of case endings, with stress playing an 
important role in some of the dialects. 

5. Sivce the case vowels for these two words when followed by connected 
complements are the only case vowels that are long in CA, and long vowels are 
the only vowels that are normally indicated in the Arabic script, the vowels 
in these examples may also be the only case vowels that are indicated in all 
of the Arabic ASP writings. 

6. Sometimes a change is said to start where au element is redundant, 
since the element is apparently not needed there because its purpose is also 
indicated by another element. For example, Corriente (1971, 1973) argues 
that a cause for the loss of the Arabic case endings was their redundancy. 
(But see Blou's 1972 reply.) However, elements are also sometimes retained 
longest in environments where they are redundant — as is claimed here. In 
Greek, foi example, the infinitive is retained longest in contexts in which 
its subject is uniquely determinable (e.g. after can and begin ) and it is, 
therefore, redundant. 
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OSU WPL 34 (1986) 84-107 
ONE RULE OR MANY? SANSKRIT REDUPLICATION AS FHAGMBNTED AFFIXATION* 



0, Overview 

Linguistics, it is well-known, is a heavily comparative 
discipline. For one thing, the simultaneous comparison of various 
related languages is universally recognized as an absolute necessity 
in historical reconstruction. Furthermore, though, crosslinguistic 
typologizing of diverse unrelated languages is also now increasingly 
accepted as an indispensable step in elaborating even synchronic 
grammatical theory. There is thus ample justification for beginning 
a discussion of Sanskrit reduplication and its broader implications 
by first citing an Armenian joke and then retelling it with an Indie 
twist. 

A whole humorous literature exists of questions allegedly 
submitted to Radio Yerevan, which broadcasts from the similarly-named 
capital of the Armenian Soviet Socialist Republic. These queries 
invariably receive the response "In principle, yes" or "In principle, 
no", usually followed, though, by additional comments which have the 
effect of completely retracting the original answer. For example: 
Question— "Radio Erevan, is it true that Comrade Ivan Ivanovich won a 
shiny new Volga automobile in the last drawing of the State 
lottery?"; Answer — "In principle, yes. But it wasn't Ivan* 
Ivanovich; it was Sergei Sergeyevich; and it wasn't a new Volga, it 
was an old bicycle; and he didn't win it, it was stolen from him 
while he was inside buying his ticket." Hence, further, along 
similar lines: Question— Radio Yerevan, is it true that Sanskrit 
reduplication involves only a single, straightforward rule whose 
elegant autosegmental treatment as just a special subtype of 
affixation supports the crosslinguistic validity of such an 
approach?"; Answer— "In principle, yes. But Sanskrit reduplication 
isn't one simple rule; it's a collection of many heterogeneous rules, 
with varying degrees of complexity; and its resemblance to 
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nonreduplicatiye affixation isn't confined to overall formal 
similarity; this also extends to their parallel morpholexical 
fragmentation, as in numerous other languages; and, finally, Sanskrit 
reduplication doesn't always involve copying; sometimes it is so 
prototypical ly affixal that it isn't really reduplication at all." 

The basic conclusions adumbrated above in the foregoing vein can 
now (with all due sobriety) be reformulated and summarized as follows: 

I. Sanskrit reduplication is not a single rule, but a 
constellation of several distinct rules. 
II. These rules are best analyzed as parallel to those for 
nonreduplicative affixes. 
III. The above analysis is motivated not only by evidence from 

individual stages of Sanskrit but also by the considerable 
historical evidence pointing to increased fragmentation of 
reduplication over time. 

IV. Fragmented reduplication— and lexical particularization of 
morphological rules in general— is not limited to Sanskrit, 
but appears to be the cross-linguistic norm. 

These conclusions are supported by a solid body of general and 
specific evidence, which we present in the sections below. 

1. On Reduplication in General 

The overall phenomenon of reduplication has lately been the focus 
of intense investigation. Building on earlier findings by Wilbur 
1971, Moravcsik 1978, and Carrier 1979, recent work by McCarthy 1979, 
1981, Marantz 1982, Broseiow 1983, Broselow & McCarthy 1983, and 
numerous others 1 , has helped bring this process into the forefront of 
current research on multilinear phonology and morphology. However, 
the facts of reduplication in Sanskrit— though well-described and 
readily available ever since the appearance of Whitney's classic 
grammar in 1889— have received surprisingly little attention in the 
aforementioned literature. Similarly, the separate body of 
contemporary research devoted to Sanskrit linguistics has treated 
reduplication in that language only tangential ly, mainly in 
connection with discussions of Grassmann's Law and related issues. 2 

Here, however, Sanskrit reduplication constitutes the central 
focus. Quite apart from the challenge it presents for 
language-particular description, this phenomenon bears directly on a 
number of significant issues in morphological theory. These 
include: (i) the degree of homogeneity shown by reduplication 
processes in individual languages, (ii) the characterization of 
reduplication as affixal or not, (iii) the nature and variety of 
affixal templates, and (iv) the relative value that grammars should 
place on semantic specificity as opposed to phonological generality 
in morphological and lexical rules. 

The most important theoretical finding to emerge from this study, 
however, is that Sanskrit reduplication requires the adoption by 
morphological theory of a new construct, the rule-constellation. This 
construct can be defined as a group of formally similar morphological 
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processes sharing at least one characteristic property of fona but 
distinguished by individual formal idiosyncrasies which prevent their 
being collapsed with one another. This concept is reminiscent) as a 
formal inveise counterpart, of the functional notion of 
rule-conspiracy in phonology, and it also recalls the notion of 
sloppy identity in the syntactico-semantic sphere of anaphora. For 
Sanskrit, the rule-constellation of reduplication involves several 
word* format ion processes which all indeed have in common a partial 
prefixal template, but each of which additionally requires a unique 
set of further template-material and segmental prespecifications. 

Equally important, though, is the related finding that the 
rule-constellation of Sanskrit reduplication reveals a preference on 
the part of speakers for fragmented morphological and even lexical 
processes — ones which are semantical ly specific at the expense of 
phonological generality, rather than phonological ly general at the 
expense of semantic specificity. Since the same preference for 
morpholexically particularized word-formation rules is exhibited in 
numerous languages other than Sanskrit, grammars in general roust be 
constructed so as to reflect this preference. 

Such conclusions thus provide a kind of hack-handed support for 
the autosegmental-morphological view of reduplication as affixation 
that is currently in vogue. On purely formal grounds, there can be 
little disagreement that reduplication — however analyzed — qualifies 
as an extreme subtype of context-sensitive morphological addition, 
and hence as affixation. But the prevalent autosegmental 
justification for such a treatment is that it obviates the need for 
transformational foimalism and so permits an economical and elegant 
treatment of reduplication which can easily be assimilated to that of 
straight foiward rules of garden-variety (non-reduplicative) 
affixation. Based on the evidence from Sanskrit, however, it seems 
that reduplication and nonreduplicative affixation also show a 
striking similarity to each other precisely in their apparent 
inelegance. That is, the morpholexical fragmentation of 
reduplication in Sanskrit and other languages is exactly paralleled 
by the existence of fragmented morphological rule-constellations 
clustering around the unquestioned affixes of, again, Sanskrit and 
other languages. A most convincing piece of evidence for this 
parallelism is discussed below (section 5), in which a Sanskrit 
reduplication-snbrule was reanalyzed as a rule affixing an invariant 
prefix — certainly the ultimate in the intersection of reduplication, 
fragmentation, and prototypical affixation. 

2. Evidence for the distinctness of the Sanskrit reduplication rules 

Reduplication is found in a large number of formations in 
Sauskrit, within both the nominal and the verbal systems. Attention 
here is focussed on reduplication in the verbal system, where five 
categories of stems — present, aorist, perfect, desiderative, and 
intensive— all show reduplication. 3 If it were the case that all 
such formations involving reduplication behaved alike with respect to 
a variety of grammatical phenomena, then one would he justified in 
speaking of "reduplication in Sanskrit" as if it were a unified 
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process. However, such is not the case, for there are in fact many 
significant formal differences in the various ways that reduplication 
manifests itself in the language. Taking note of these differences 
is the firot crucial step in demonstrating that Sanskrit 
reduplication is indeed a "rule-constellation". 

For example, the prosodic template associated with reduplication 
formations is most commonly CV-, as the underlined elements in (1) 
show: 4 

1. ta-tap- (perfect stem of tap- 'heat*) 
vi-V8k- (present stem of vac- 'speak*) 
du--dru- (aorist stem of dru- 'run') 
su-sut-sa- (desiderative stem of s'udh- 'purify') 

However, there are several other forms that this template can take. 
Thus, in addition to the CV- type in (1), there is also (treating 
long vowels (V:) as VV): 

2. a. V-: e.g. u-va:c- (perfect stem of vac- 'speak'), i-nak-sa- 

(desiderative of nas- ' attain' ) & 

b. VV-: e.g. ajv- (perfect stem of ay^ 'favor') 

c. VC-: e.g. aj_n-ams- (perfect stem of 'attain'), 

am- am- a- (aorist stem of am- ' in.iure' ). al-ar- 

( intensive stem of r^ 'go'), iy-ar- (present stem of 

£z 'go') 6 ^ 

d. CVV-: e.g. vaj.-vac- (intensive stem of vac- 'speak'), 

jai-jval- (intensive stem of .ival- 'burn, flame'), 
mu: -muc- (aorist stem of muc- 'release') 

e. CVC-: e.g. bad-badh- (intensive stem of ba: dh- 'oppress'), 

dan -d ah- (intensive stem of dah- 'burn') 

f. CVCV-: e.g. kari-kr- (intensive stem of kr- 'make') 

g. CVCVV-: e.g. gani:-gam- (intensive stem oT "gam- 'go'), 

mari: -mar- (intensive stem of mr- 'die'). 

Admittedly, some of these shapes are restricted to particular 
categories; e.g., CVCV(V)- (as in (2e/f)) is found only in the 
intensive formation. And there are, to be sure, predominant patterns 
in any given category (e.g. CV- in the perfect, present, and 
desiderative), and some evidence of regularization of irregular 

formations, e.g. the Vedic perfect stem of vap- 'strew' u-va: p- > 

later va-va:p-. However, the existence of these divergent shapes 
cannot be discounted, and they cannot be assigned to just one type of 
category; rather within each verbal grammatical category with 
reduplication, a number of template shapes are to be found. Thus, 
the evidence of the variety of prosodic templates used in categories 
with reduplication supports the notion that reduplication in Sanskrit 
cannot be viewed as a unitary process with a single template valid 
for all reduplicative formations. 

A second feature which differs in the various reduplicative 
formations is the feature prespecification for the consonant(s) in 
the reduplication syllable. The predominant pattern is for these 
consonants to be (-aspirated, -back], so that a velar in the root is 
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reduplicated as a palatal and an aspirated consonant is reduplicated 
as a nonaspirated consonant. This pattern is illustrated in (3), and 
note especially (3e) where it is the second consonant that is 
redup lice ted as [-aspirated]:' 

3. a. jar gam- (perfect stem of gam-, 'go 1 ) 

b. bi-bhed- (perfect stem of bnld- 'split 1 ) 

c. ji-ghra:- (present stem of ghra:- 'smell 1 ) 

d. ji-ga:-sa- (desiderative stem of tfa:- 'go 1 ) 

e. had-hadlv* (intensive stem of he : dh- 'oppress 1 ). 

However, there are also a few formations in which [+aspirated] and/or 
[+back] consonants appear in the. reduplication syllahle(s) . For 
example, the Vedic subtype of the intensive formation with disyllabic 
reduplication docs not follow the predominant [-aspirated, -hack] 
pattern seen above in (3):& 

4. a. gain. gam- (intensive stem of gam- 'go 1 ) 

b. bhari : "bhr- (intensive stem of bhr- 'bear 1 ) 
c ghani :_-ghan- (intensive stem of han^ 'smite')* 

Moreover, in later Sanskrit, there is a reduplicative adverbial 
formation which allows [^aspirated] consonants in the reduplication 
element, e.tf. rat ha: rathi 'chariot against chariot' (cf, ratha- 
'chariot'). 9 

Thus, reduplication syllahles do not all reflect a uniform 
consonantal prespecif i cat ion in their templates. 

Nor is it the case thrt reduplication syllahles follow a uniform 
template prespecification for vocal ism, giving yet another reason for 
treating the various reduplications in Sanskrit as formally distinct 
processes. In particular, there is no constant vowel quality or 
vowel length across all reduplication rules; the examples in (1) 
through (4) above show a, i t u, aj., uj_, and a — ij. in the 
reduplication syllable(s), and other vocal isms are to he found as 
well: 

5. a. biM>lia:y- (perfect stem of bhi:- 'fear') 10 
I). ti:-tap- (aorist stem of tap- 'heat') 

c ne-nij- (intensive stem of ni j- 'wash') 

il. bo-bhu:- (intensive stem of bhu:- 'become')** 

Thus, it clearly is not possible to state a single pattern for the 
vocal ism of the reduplication syllable(s) that is valid across all 
reduplication types in the language. However, each reduplication 
rule does have one most common, unmarked value (and a variety of 
marked values) for the length and quality of the "rhyme" of the 
reduplication syllable, e.g. i in the desiderative and present, i± in 
the aorist, a copy of root vowel in perfect, and a heavy 
reduplicative syllable (e.g. a long vowel or diphthong, though 
CVC(V(V)) is possible too) in the intensive. The examples in (1) 
through (5) above illustrate these categorially-based uniformities, 
as well as some of thi* marked divergences within each category. Such 
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a situation alone points to at least five distinct reduplication 
rules for the verbal system, and the marked subvarieties way well 
give evidence for the need for further fragmentation in the 
description of reduplication in Sanskrit. 

One additional striking difference in the various reduplications 
lien in the placement of the reduplication syllahle. In particular 
the reduplication syllable is mostly prefixed, as in all the examples 
ahove,i- but l^re is a subclass of desideratives and another of 
aorists (both formed from vowel-initial roots) in which there is 
internal reduplication, with a -jCi- reduplication syllable being 
infixed before the final consonant of the root. A few examples of 
this small but mildly productive class are given in (6): 

6. a. e-di-dh-isa< (desiderative stem from edh.- 'thrive') 

b. n:-pi-p- a - (aorist stem from a:p- 'obtain') 

c. e-di-dh-a- (aorist stem from "edj^, cited only in native 

grammatical literature) 

d. ar-ji-h-isa" (desiderative stem from arh- 'deserve', cited 

only in native grammatical literature). 

Certain of these forms, especially those with no change in the 
reduplicated consonant, e.g. a^ina.:, could even be considered to 
have reduplicative suffixes (i.e. with an analysis (a: P »i £ -a-] ) , 
though the clear cases (where there is a change in the reduplicated 
consonant) seem to have internal placement of the reduplication 
syl able.^a While this type probably arose by a reformation of an 
earlier form with prefixed reduplication (perhaps *ul-idh-) to 
est±4hz by analogy to the root vocalism^, this i atter fonn adjftits of 
synchronic analysis into a discontinuous root e^.dh- with infixed 
reduplication (.-jJi^) . The fact that this pattern was also extended 
to other such roots suggests that this is the analysis that (at least 
some) speakers actually made. 

A final difference among the various reduplications in the verbal 
system of Sanskrit concerns certain root idiosyncrasies associated 
with reduplication. In particular, five roots show a "reversion" of 
the root-initial palatal to a velar consonant in various 
reduplication categories, but this reversion is not found uniformly 
across all the categories for those roots. For example, while the 
reversion always occurs in the desidernUve, it otherwise is 
scattered across the remaining categories. The following is an 
(attempt at an) exhaustive listing of the relevant forms, grouped 
according to root, which show reduplication categories where 
reversion occurs and, where tiiis ca:» be determined, those where it 
does not: 11 



7. a. ci- 'note': ci *ke (present stem), ci-ki^-sa- 
( desiderative stem), ci-ka^v- (perfect stem) 
£Ur 'perceive': c.i-ket- (perfect stem), ci-kit-sa- 

(desiderative stem), ce-kjt- (intensive stem),* but, cited 
by native grammarians: ci:-eit- (aorist stem), ci-crt- 
(alternative perfect stem) 15 
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c. Ji; 'conquer 1 : ji- ga:y - (perfect stem), ji-gi:-sa- 

(desiderative stem), but jit-jay- (aorist stem), je- ji:y- 
(intensive stem, from native grawonrians) 

d. hi^ 'impel': ji-ghy-a- (present stem) 15f \ ji- ghi: -sa- 

(desiderative stem, from grammarians), but ji:-hay-a- 
(aorist stem, from grammarians) 

e. han- 'smite': ji:- ghan- (aorist stem), jan-ghan- 

( intensive stew), ja- ghan - (perfect stem).* 0 

Taken together, then, these facts concerning formal differences in 
the various manifestations of reduplication in Sanskrit point clearly 
to the conclusion that it is misleading to speak of reduplication in 
Sanskrit as if it were a unitary process. Instead, a good many 
reduplication subrules are needed — for observational as well as for 
descriptive adequacy. 

3. Evidence for the Clustering of the Various Sanskrit Reduplication 
Rules 

Despite the conclusion just drawn from the facts in section 2, 
there are, nonetheless, some striking waya in which the various 
reduplication rules are formally similar. These facts constitute the 
second crucial step in demonstrating that the reduplication rules 
for* a rule-constellation, since they show that the rules have some 
formal properties in common. 

The first such property is a trivial one, but must be mentioned 
nevertheless. As is clear from the examples in (1) through (7) 
above, all reduplication templates contain at least a vowel. 
Moreover, in a fully autosegmental analysis, they would all be marked 
with the feature [-(-reduplication]. 

There are, however, more significant common features. In 
particular, all reduplication rules show the same tegular contrast in 
the differential copying of root consonant ism with initial sibilant 
(s> 5> §, though this last involves a perhaps somewhat nonstandard 
use of the term sibilant) clusters. Where the second segment in the 
cluster is a stop, only the stop is copied, i.e. #S(ibilant) + 
(s)T(op)...- — > T-vowel -ST. but where the second segment is a 

resonant, the sibilant is copied, i.e. #S + R(esonant). . > 

S-vowel -SR.. .-. Examples of the stop-type are given in (8a); 
examples of the xesonant-type, in (8b): 

8. a. stha:- 'stand' — > ti-stha- (present stem) 

spis- 'touch' — > pja-sprs- (perfect ste»a), p_i-sprk-sa- 

(desiderative stem) 
stu- 'praise' — > tu-stav- (aorist stem) 
sthi: v- 'spew' — > te-sthi:v-/te-sthi:v- (intensive stems, 
/ from grammarians) "* / 
scut- 'drip' — > cu-scot- (perfect stem), cu-scut-a- 
/ (aorist stem) / 
D « sru- 'hear' — > si-sra:v- (perfect stem), su-siuv- (aorist 
stem) 

smr- 'lemember' — > sar-smt*- (intensive stem), su-smu:r-.sa- 
(desiderative stem), both from grammarians. 
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This is the one significant formal feature common to all 
reduplications in Sanskrit without any exceptions. 16a It is 
important to stress "formal" here, for it is the case that in 
general, reduplication is not unitary within categories from a 
functional and/or semautic standpoint. With the exception of the 
intensives, whose steins always have reduplication of some sort, there 
are nonreduplicative formations to he found in each of the categories 
that show reduplication.*' By its unique coinmonoJity to 
reduplication, this feature gives some unity to what is otherwise, 
from a formal standpoint, an assortment of numerous different 
rules. A'a Still, given the rather large number of featuies on which 
the reduplication rules disagree, it seems best to conclude that they 
do indeed form a rule constellation, united primarily in the way they 
treat sibilant clusters and related in the fact that they all involve 
at least a vowel prefix, but distinct nonetheless in their behavior 
with respect to a wide variety of formal aspects. 

4. ContrastjwUh.^ff_r Ana.l^;ses_wi th HttJe_.qLj_io. Recogn 1 1 i on of 
Fratfl n entation " " 

While Sanskrit reduplication has heen mentioned quite extensively 
in the generative literature (s«e footnote 2), the view of Sanskrit 
reduplication taken here is an entirely novel one. One notahle 
exception is Jchindler 1976, which talks (p. G27) of the remnants of 
Grassmann s Law in Sanskrit as "one of several morphological rules 
that apply ... (inl redu- plication". I 8 For the most part, previous 
researchers in the generative framework have either acted as if the 
reduplication rules in Sanskrit were a unitary process, or as if 
there were at least unity within categories. Thus, Sag (1976, p. 
617) gives "the reduplication rule" as: 

9 - (fRO0T)_ C ^ V X 

3 - 4 — 2-3-1-2-4. 



i 'r 2 I. 

I <velaH 



r palatal^ 
•asp | 
etc. J 



Similarly, Cairns & Femstein (1982, p. 210>1), following Kiparsky 
(1979, p. 434-5) declare that "the Sanskrit [reduplication] rule 
will have the foim:_ copy Mc " [= margin core of root syllable's 
onset], and barantz (1982, p. 448-9n.9) speaks simply of "Sanskrit 
initial reduplication". Anderson (1982, p. G02), on the other hand, 
implicitly recognizes categorial differences in reduplication, hut 
nonetheless gives a single rule for "the reduplication in ... ("the 
perfect stem"]": 



10. 



rVverb 1 
[/PerfectJ 

f" *coronaf) s [ -s 
|^ +cont J ^ <fobatruent>| 



/< p* corona l"]^ [" -syllabic "] C Q (a) {^syllabic) X / 

2 J 3 4 5 6 

/ 2 5 1 2 345 6 /. 
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Siwilarly, Borowsky & Mester (1983, p. 53) refer to the "the 
formation of the perfect (in Sanskrit) ... by prefixing a template 
CV- to the root and copying and associating the segmental melody", 
though recognizing some categorial differences by referring (p. 
61n.2) to "[some] intensive forms (which) involve reduplication of 
the entire root morpheme". To a certain extent, these analysts were 
simply giving the unmarked or predominant formative process in each 
case, but such oversimplifications dangerously obscure the actual 
quite fragmented picture. As noted already (and in the next 
section), this fragmentation is to be expected, given the affixal 
nature of reduplication, so the contrast here is not just one of 
detail but rather one of substance. 

5. Parallels Between ^duplicative and Nonreduplicative Sanskrit 



Calling reduplication affixal in natuie means treating it as not 
distinct in any meaningful way from the (nonreduplicative) affixes of 
the language. This view has been argued for by Marautz 1982 for 
reduplication in human language in general, and it certainly holds 
for Sanskrit, based on both synchronic and diachronic facts which 
show that a number of parallels obtain between (undisputed) affixes 
and reduplication in Sanskrit. 

From a synchronic standpoint, there is considerable motivation for 
treating reduplicative elements as a type of affixation. Most 
importantly, doing so fills out holes in the distribution of both 
reduplication and (nonreduplicative) affixation. That is, while 
there are numerous (nonreduplicative) suffixes in Sanskrit, there is 
but one grammatical prefix, the past tense marker *z ( tne so-called 
"augment", see footnote 12). Similarly, while the placement of the 
reduplication element is mainly prefixal, one subpattern of the 
desiderative and aorist subtypes noted above in (6) — namely those 
forms that show no change in the reduplicated consonant (e.g. 
n:pii>-a- )~ -admits of analysis as having reduplicative suffixes. Thus 
by treating reduplication as a type of affixation, the one otherwise 
anomalous grammatical prefix, the augment, ceases to an irregularity, 
and the one type of anomalous suffixal reduplication likewise is no 
longer irregular. Moreover, it can be noted that there are both 
reduplicative infixes, as in the type of (6) with changes in the 
reduplicated consonant (e.g. e~di-dh-), and nonreduplicative infixes, 



29 roots, including those in (ll): 19 

11. a. yuj- 'join- — > present stem yu-na-j- 
("strong")/yu-n--j-("weak") 

b. r udh- 'obstruct* — -> present stem ru-ya-dh- 
"* ("strong")/ru-n-dh-( M weak") 

c. chid- 'cut off -* -> present stem chi-na-ri- 

("strong")/chi»n-d-("weak"). 

In terms of the distribution of their placement with respect to 
roots, thfin, an affixal treatment of reduplication serves to 
eliminate irregularities both in reduplication and in the 
(undisputed) affixes. 



Affixes 




forms the present stem of t some 
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From a diachronic standpoint , though, the evidence is even 
stronger, for many of the things that have happened to affixes in 
Sanskrit have also happened to reduplication syllables. This 
parallel behavior suggests that, speakers treated the two alike. 

For example, both affixes and reduplication syllables sporadically 
underwent a Joss of their identity due to their reanalysis as part, of 
a root. Thus the synchronic root Cpinv- 'fatten 1 (evident in, for 
instance, the perfect stew pi-pjnv-) represents a reanalysis of an 
earlier present stem from the root pi(:)- 'swell, fatten 1 formed with 
the often factitive suffix ^iu-, i.e. *p7-nu--. Similarly, the 
originally reduplicated intensive stem .iai-gr- 'wake 1 was reanalyzed 
as an indivisible root ja:g£., which is evident, for instance, in the 
1SG present ja:gr-mi.20 and in nominal derivatives such as 
ja: dar-akn- 'waking*. 

Another development found with both offixfs and reduplications 
involves the obscuring of original boundaries and distributions, in 
what may be called accretions and extensions. Typically, these 
happen by some type of reanalysis. Thus, the locatival adverbial 
suffix -ta: t . added more or less pleonastical ly to other adverbials, 
e.g. pra:k- ta: t 'from the east' (cf. pra:nc- 'forward, east'), 
puras-taj_t 'before; in/ from the east' (cf. pur a s 'in front, 
forward'), was resegmented to ^sta^t, presumably in forms such as 
purastn :t, and then extended to other forms, e.g. uparj- sta: t '(from) 
above* (cf. npari 'above'). Simlarly, the "union M -vowel i/i7 was 
originally part of roots (due to the Indie treatment of 
Proto-Indo-European root-final laryngeals) but came to be considered 
part of adjoining suffixes, creating virtual alloworphs of the 
suffixes, so that the agent ive rtr- gained the allomorph -itr-, the 
desidcrative -sa- gained the allomorph -±sa-, etc. In somewhat 
parallel fashion, the reduplicative intensive prefix with CVR- shape 
that regularly occurred only with roots containing a resonant was 
extended, with an -n- that was originally proper only to roots with a 
nasal, to other roots, e.g. .jafl-gah- (intensive of gn:h- 'plunge', 
and cf. the alternative intensive stem with no final C- in 
reduplication, ja: -ga:h-. cited only in the grammarians) . Moreover, 
at some point in the development of the intensive reduplicative 
prefixes involving reduplication of the whole root, an ii t of 
somewhat uncertain origin, accreted onto the reduplicative, prefix, 
giving forms such as hhari ^-hhr- (from hhp- 'hear'), and ultimately 
becoming part of a disyllabic snbpattern for intensivos (see also 
footnote 8). 

Most significant, though, for the view advocated here is the fact 
that, in at h»ast one instance, a reduplication syllable, even though 
its connection to the root was reasonably transparent ■ was reanalyzed 
as an affix: Sin-ams-, the perfect stem of asj; 'attain', and a_^n- 
an£~, perfect stem of uiVj;. 'anoint', served, via the identification 
of the a:n- as merely an affix, as the basis for highly anomalous 
perfect stems of other roots with initial a.; or £*, e.g. ajn rdh-, 
perfect stem to rdh - 'thrive', a:_n-rh , perfect stem to a_rjv~ 
'deserve', etc. The fact that a reduplication syllable could move so 
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easily to an existence as an affix suggests again that reduplication 
is in actuality a type of affixation. 

Given these facts about the parallel diachronic behavior of 
reduplication syllahles and affixes, and given the synchronic 
motivation for treating them in parallel fashion, one can 
legitimately question even calling the phenomenon "reduplication" in 
all instances. Especially in the case of the a: n -rdh-/ aj_n-rh- type 
of perfect (cf. above and (6)) and of such irregularities as perfect 
stem ja-bhar- (versus expected and later-occurring ba-bhar- ) from 
hhr- 'hear', there seems to he no reason to speak of "reduplication" 
except in order to stress a parallel with other formations in the 
same category. Thus, some context-sensitive prefixation probably is 
not reduplication at all. Moreover, if one speaks instead of 
"affixes** in these and the other cases, then a: n-rdh- (etc.) can be 
said to contain a perfect affix that happens to have no direct formal 
connection with the root it attaches to, while a more regular 
formation such as ta-tap- contains one that does have such a direct 
formal connection. 

Sanskrit reduplication, then, is not only best treated as a 
fragmented constellation of related morphological processes, but 
further, these proceses are best taken as affixal in nature. 

6; D iachronic Fragmentation of Morphological Rules in Sanskrit 

The rule constellation of Sanskrit reduplication not only is 
synchronically fragmented but also can be shown to have arisen via 
the diachronic fragmentation of an originally more unitary 
situation. This conclusion rests on a mass of philological evidence 
that, can only be summarized here. 

Sanskrit is unquestionably a historical development of 
Proto-Indo-European (PIE), and the source of Sanskrit reduplication 
is ultimately to be sought in this proto-language. Parallels for 
each of the Sanskrit reduplicated categories are to he found in other 
Indo-European languages. As a result, the proto-language is 
standardly reconstructed (as in Meillet 1964) as having virtually all 
the reduplicated (verbal) categories found in Sanskrit. However, the 
standard reconstruction (Meillet, pp. 179-182) also shows greater 
unity within each of these proto-categories with reduplication than 
is found in Sanskrit; for example the vocalism in the perfect's 
reduplication syllable is *e, while that for the present is *i, etc. 
(compare (5)). Similarly, the highly particularized forms such as 
u-va:c- (as in (2a)), a. n -r dh- (see section 5), ja-bhar- (see section 
5), and others, are not reconstructive as such for PIE. Thus, in 
the development of Sanskrit from PIE, a diachronic fragmentation of 
reduplication occurred. 

Moreover, Sanskrit is attested over a long enough period of time 
that it not only has a previous history but also an internal one. 
And, within Sanskrit, idiosyncratic forms such as u-va:c~, aMi— rdh-, 
ja-bhar-, etc. can be explained only as particularized replacements 
for more regular forms: u-va:c*- through the lexicnlization of a. 
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former sound change grown opaque (one deleting *v- before u), 
a.Mrrdh~ via reanalysis and analogy (see section 5), and ja-bhar- 
probably via contamination. Furthermore, the fragmentation of the 
rules for nonreduplicative affixes in Sanskrit (as already noted in 
section 5), can also be shown to be a historical innovation. Thus 
the Sanskrit -internal evidence of the development of reduplicative 
and nonreduplicative affixes shows speakers to have exhibited a 
preference, in many cases, for fragmented morphological rules and 
processes--i.e. for rulcconstellations. 

Actually, though, the motivation for this conclusion can be shown 
to be much stronger and even more compelling, once the perspective is 
widened to include more languages than just Sanskrit. 

7. Morphological Fragmentation as the Crosslinguistic Norm 

If the extreme morpholexical part icularizat ion of reduplication 
found in Sanskrit were a completely isolated case, one could perhaps 
attempt to counter the appaient need for a morphological construct 
like the rule- constellation by claiming that the Sanskrit phenomenon 
in question represents merely an accidental and/or highly marked 
situation. But fragmented reduplication is in fact found in so many 
languages that any such line of resistance clearly is totally 
untenable- In every langauge known to us which utilizes 
reduplication to mark either a single grammatical 1 y-centr a 1 
morphological category or else several morphological categories 
(whether central or more peripheral), this functional importance 
and/or variety is always accompanied by at least some degree of 
morpholexical ly-particulari zed formal fragmentation. Thus, 
reduplication seems to be a rule-constellation, not only in Sanskrit, 
but also in Kfhehe fOddeii & Odden 1985), Madurese (Stevens 1986), 
Tagalog (Carrier(-Duncan) 1979, 1984), and many other languages too 
numerous to discuss or even list here. Furthermore, even in 
languages where reduplication plays a rather minor role (in terms of 
functional variety and centrality;. there is still usually a 
considerable amount of formal differentiation, as shown for instance 
)>y the contrast in English reduplicative (or at least 
reduplicativoid) fonns such as higaledy pi^ledy versus flim-flnm 
versus din-di n, etc. 

In order both to cement the crosslinguistic validity of this 
overall point and to stress that it is not always clearly brought out 
in the literature, the treatment of a particular language can be 
cited, taken from one of the most influential recent articles on the 
importance of reduplication for morphological theory (Marantz 1982, 
pp. 474-475). After first introducing "Tagalog reduplication M as if 
it were a single general phenomenon, the discussion then mentions 
that Tagalog really has at least "three different sorts of 
reduplicatiun". Finally, a footnote reveals that, even though the 
analysis sometimes proceeds "as if th? various reduplication prefixes 
... arc each single, uniform morphemes ... f — ajctually, ... each 
prefix has a variety of uses ... (so that] each must be understood as 
the morphulogicnl form of a set of homophonuus morphemes . " Here, 
too, then, the notion of reduplication as a rule-constellation is 
arguably present, implicitly lurking just below the surface. 
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Moreover, synchronic and diacharonic fragmentation of 
nonreduplicative affixation— in fact of morphological processes in 
general — is also extremely coimQon crosslinguistically. Two 
straightforward cases from English can be adduced involving 
adjectival suffixes. The alternation between ^al and z§r was once 
phonologically conditioned, hut recent pairs such as line-al/line- ar 
and fami li-al / fami li-ar show that the -al/-ar contrast no longer 
represents allonorphy, but instead two separate nearly-horaophonous 
suffixes clustering as an affixational rule-constellation around the 
formal core Vowel + Liquid. Likewise, such disparate forms as 
drink-ab le, pot-able , coinfort-able . and surviv-able (as used of 
nuclear weapons which are — unfortunately— intended to survive, rather 
than be survived) demonstrate that there are now several homophonous 
affixes - able . That is, the different raorphosyjitactic conditions 
embodied in their various foiiaal statements prevent them from being 
collapsed with one another (see also Aronoff 1976). German similarly 
has a two-element rule-constellation for adjectival suffixes 
expressing material composition (e.g. Seide/seid-en 'silk '/'silken* , 
with -lejn, versus Stein / stein -ern 'stone'/'stony' , with - ern . where 
final -n is shared), and at least a three-part rule-constellation for 
agent ive suffixes (e.g. Dien-er 'server, servant' versus Tisch-ler 
' tahle-maker, cabinet-maker' versus Red-ner 'speaker, orator'— where 
final rer is shared). 

Nor are nonaffixational cases of morphological rule-fragmentation 
hard to come by. For example, a phonologically rather arbitrary set 
of tone-substitution processes in Copala Trique (see Janda 1982a, 
Hollenbach 1984, and references there) performs the three functions 
of deriving adjectives from nouns, inflecting nouns for 
possessedness, and inflecting relative clause verbs for continuative 
aspect. In addition, a subtractive process of final vowel deletion 
in Rotuman marks the "incomplete phase" (see e.g, Janda 1983b, 1984, 
and Hoeksema & Janda 1985), hut this category turns out to he merely 
a convenient cover-term for a set of uncollapsible distinctions 
including indefinite nouns, verbs in the imperfect ive aspect, 
emphatic words, and nonfinal elements in a noun phrase. In fact, the 
same farrago of categories are all arguably sometimes marked hy 
morphological metathesis in Hotuman, too, and a similar process of 
permutation is involved in a rule-constellation of Clallam (see the 
references noted above for Rotuman). Surely the Most extensive (and 
hence most fragmented) rule-constellation currently known, though, is 
instantiated by Modern High German Umlaut (see Janda 1982a, 1982b, 
1983a), which has been morpholexically particulai ized so severely 
that it not only occurs alone in six different rules, but also occur* 
with thirty distinct phonological shapes of affixes, which themselves 
represent at least twice that many morphological rules. In total, 
then, the Umlaut constellation demonstrably involves between sixty 
and seventy morphological rules, nost of which share an identical 
formal core, but some of which are strikingly different in their 
structural descriptions. 

In light of the ev'ioence just presented, it can thus he said, 
without exaggeration, not only that fragmented reduplication is not 
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iShmff t f ° SanSk I it| bu * "I 30 that -orpholexical fratftentation-uf 
nro^i^! -° n : ° f ^reduplicative affixation, an d of morphological 
ll° n T*S ln « e "eral--is indeed the crosslinguistic norm, both aa a 
synchronic state and as a diachronic change. Such states and changes 
oart J inte ' preted to "fleet a strong and constant tendency on the 
part of speakers to particularize (formerly) no re general 
morphological processes as markers of no re specific lexical and 
grammatical categories. That is, given the notion of the 
morpholexical rule-constellation as a way to express the unity of 
similar rules even in the face of their formal diversity, what 
emerges as the dominant and driving force in creating such 
constel ations is the lexico-semantic motivation of speakers: the 
high value that they seem to place on the unambiguous and even 

transmission of information about specific meanings as 
expressed by particular lexical ite»« (mainly morphemes and words) 
and classes of lexical items. For example, the occurrence of the 
usual German agent ive marker zor is far less revealing about what 
stem precedes it than that of its co-members in the constellation 
(its co-3t a rs J-ser and ^ler, precisely because the latter have a 
wnfch*^ lacKs) distribution < a " d al3 ° stress certain connotations 

* Jl a P^ ears that the morpholexical fragmentation at issue here is 
fed by three main sources (although a full discussion of such topics 
must be deferred until a later time and place). First, there is 

^n™?^" °l! ( ?r d 1 f xica ] i «« tio n) of formerly purely-phonological 
processes, which often transfers the conditioning for such a process 

from a single phonological configuration to severai morphemes which 
once had something to do with that configuration. In this way, a 
once-unitary formal operation can become fragmented via its multiple 
direct association with numerous affixes and/or roots (German Umlaut 
being a notorious ca3e in point). Second, there is accretion by 
metanalysis, whereby a reanalysis of morpheme boundaries results in 
f l e 5i! C K 7 e add \ tlon *° 80 existing morpheme of segments which 
formerly belonged to another morpheme (as illustrated by the German 
agentiyes mentioned above, of which Z ngr and ^ler are the result of 
accretions to ^er based on resegmentation of forms like Rechn-er 

calculator, and Betfl-er 'regulator 1 , respectively). 

Third and finally, but perhaps most commonly, M.ere is reanalysis 
of root + affix combinations in such a way that not segments but 
ratnef*semantic and/or morpho(phono) logical properties of a 
particular root or roots are re-signed (or jointly assigned) to the 
attix, which thereby becomes correspondingly fragmented from other 
instances of the formerly identical affix occurring with different 
roots. It is apparently in this way that Sanskrit forms such as the 
aforementioned ajjj-rdh- arose: the reanalysis of words like a:n-aws'- 

as having an invariant initial morpheme aH£ rather than a ' 

reduplicative affix made available a prefix a^ which could then be 
used elsewhere. This mechanism can perhaps be most vividly expressed 
by the following metaphor: when a given affix is deposited in the 
bank of the lexicon along with a particular amount of principal 
contained in a specific root, the account draws semantic and 
aorpho(phono)logical interest mainly on the root or the entire word, 
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but such interest can be taken along when the affix in question is 
withdrawn for use with another root. 

The mechanisms just described as conspiring to produce 
morphological and lexical fragmentation (i.e. rule- constel Jations) 
can be characterized as operating diachronically, but such a 
characterization hy no means absolves linguistic theory of the 
responsibility to account for such phenomena. In fact, it does just 
the opposite, given the usual generativist assumption that language 
change is governed primarily (if not exclusively) by constraints of 
synchronic grammar. Actually, then, the evidence presented heiein 
regarding speakers' preference for fragmented— that is, 
morpholexically particularized, rule-constellational — analyses of 
reduplication and morphological processes in general requires that 
grammars he constructed so as to pioce a premium on morpholexical 
solutions to linguistic problems. Generative grammar, 
having — correctly, it seems — made a diachronic hed governed primarily 
by synchronic principles, is here forced to lie in it: the historical 
frequency nnd ubiquity of morpholexical fragmentation leave one 
little choice but to build not just a place but even a preference for 
.such rule-"mitosis" and the resulting rule-constellations into 
(synchronic) grammatical theoiy. Moreover, the explanatory potent ial« 
of such an approach is extraordinarily great, for it promises to 
cover not only the fragmentation of morphological rules and its 
associated morphologization of phonological rules, hut also 
"downgrading", the morphologization of syntactic phenomena brought 
about hy such interacting processes as semantic bleaching, 
cliticization, and clitic-to-affix conversion. Still, limits of space 
preclude a fuller discussion of sucli issues at this point, so a 
number of observations are presented in conclusion regarding general 
lessons for the elaboration of morphological theory that emerge from 
this particular study of fragmented reduplication in Sanskrit. 

For one thing, there can he no substitute for f ine-gi ained studies 
of particular instances of a given phenomenon (e.g. reduplication) in 
a single language (e.g. Sanskrit) as the prime source for revealing 
insights into the nature of such a crosslinguistically common 
morphological process (again, as reduplication). Studies which 
3nperficnlly draw only selected "representative" data from a wivle 
range of languages, ignoring exceptional forms as uninteresting and 
focussing on elegantly-describable forms, simply will not do. They 
proceed o little like the drunk who dropped his keys in the dark just 
outside the tavern-door but went looking for them up the street under 
the lamppost because the light was better there. Actually, they are 
even less defensible than this, methodologically, because they tend 
not to take a fair look even for what they seek, hut rather to start 
out with an artificially limited preconception of what they will 
consider as relevant data. It is not surprising then that 
reduplication looks crossl inguistically elegant under an 
autosegmental analysis if in fact the primary data going into such an 
analysis have been selectively gathered from languages so as to favor 
straightforward reduplication rules while passing over exceptional 
and complicated iiil«s of this type. 21 
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Next, a related issue, it must be concluded that the attempt to 
exclude sucli except ionnl and complicated forms, trow consideration on 
the grounds that they are somehow "marked" is not only circular but 
also completely uudeicut by the fact Hint fragmented reduplic.it ion 
mles (and rule-constellations in general) are staggeringly eonion 
both across and within languages. When the situution of having many 
allegedly marked phenomena (like fragmented reduplic.it ion; is the 
unmarked case in languages, then markedness itself js probably best 
excluded from consideration as an explanatoiy factor in attempts to 
account for such fragmentation, or at, least needs to be set aside for 
urgent ree va 1 ua t i on . 

renultimately, the essence of morpholexical part lculorization of 
rules as an activity that is lexico semantics 11 y driven should bring 
to mind that the study of formal aspects of woi d format ion does not 
exhaust the subject -matter of moi phology. Rather, even what appear 
to be purely formal charactei istics of word formation, such as 
fragmented reduplication in Sanskrit and many othei languages, may 
often turn out to have some lexu.o semantic motivation. One should 
keep in mind here Jnkobson's dictum that language without meaning is 
meaningless. 

Finally, the entire phenomenon of morpholexieul rule -frugmentat ion 
and rule-constellations bears directly on issues raised by the 
frequently heard saying (apparently due originally to Meillet 
(1903-1904, p. 641) that "le langage forme un systeme ... ot tout se 
tient" ('language forms a system where everything holds together'). 
Whether implicitly or explicitly, this claim is surprisingly often 
taken to mean that all aspects of linguistic structure aie equally 
directly and equally closely linked to one anothei , and this 
interpretation then results in a pi inciple that those linguistic 
analyses are always to be preferred which yield a maximum of 
structuial homogeneity and intei connectedness. While laudable in the 
abstract, such a viw tends ultimately to have a Procrustean effect, 
since it entourages the brute foice honing out of recalcitrant 
details within one domain of linguistic analysis on the basts, not of 
internal considerations, but of fit with other areas of grammar. 
Moreover, hard evidence against sach a practice is available from 
such research as Ferguson's work on simplified registers like 
foreigner- talk, which in certain ciucinl respects appear to be 
independent of the rest of grammur. Ac Ferguson 0981, p. 3) boldly 
put it: "Tout does not se tenir", i.e. though everything ni language 
obviously holds togethei with something, it is not the case that 
everything holds together with everything. 

Sucl* a conclusion could hardly be moiu strongly supported by the 
facts tf fragmented Sanskrit reduplication and of cross linguistic 
morpholexical particulnrization in general. As such phenomena 
evolve, they involve gradual but steady foimal and functional 
development of morphological and lexical rules away from each other, 
with no apparent regard for anything else in graimnai except the 
expression of lexical at id lexical class semantics fand of 
grammatical-morphemic notions). In fnct, the historical linguistic 
literature is replete with similar instances where locally motivuted 
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changes in grammar led to complexity (and often subsequent change) 
elsewhere. Given the difficulty which even full-time professional 
linguists have in keeping the entire grammar of a single language in 
wind at once, is it actually at all surprising that naive speakers 
behave as if they are unable to do this? Rather, it seems that their 
conscious and unconscious dealings with language are severely 
constrained in scope by a highly United window determining how Much 
grammatical structure they can consider at one tine. At the very 
least, positing such a limitation seems the appropriate step to take 
in accounting for fragmentation of reduplication and other 
morphological processes in Sanskrit and elaewhere. In this way also, 
such notions as Wittgenstein's family-resemblance and Rosch's 
prototypes, especially as they have already been brought into 
linguistics l>y other scholars, suggest themselves as having much 
explanatory promise. 

An approach like this has a bright future, then, but it also lias 
an estimable past. Although the phenomena of Sanskrit fragmented 
reduplication led us to the concept and name "rule-constellation" 
prior to our encountering relevant work by Louis Hjelmslev, that 
linguistic pioneer turns out to have anticipated such a notion in a 
general way nearly half a century ago, and by way of conclusion, his 
words on this overall topic are given (in our own translation, from 
the French of Hjelmslev 1939/1959, p. 114): "The famous maxim 
according to which tout se tient dans le syst&ae d'une langue 
{'everything holds together in the system of a .language'] has 
frequently been applied in too rigid, too mechanical, and too 
absolute a fashion. One must keep natters in proper proportion. It 
is important to recognize that everything holds together, hut that 
everything does not hold together to the same extent, and that 
alongside interdependences, there are also purely unilateral 
dependencies, as well as pure constellations.' 1 It is our hope that, 
in the present study of Sanskrit reduplication as fragmented 
affixation and of its broader implications, we have given such ideas 
as those just quoted a concrete enough form so that they not only can 
receive a principled answer from Radio Yerevan but will also find 
practical application in the morphological and general-linguistic 
investigations of other scholars. 
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Alan Hudson-Bdwards, Ann Miller, Joel Nevis, and Arnold Zwicky for 
other kinds of help in the preparation of this work. 
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2. Stemberger 1980 has a review of recent generative literature 
on this topic; Borowsky & Mester 1983 is the most recent proposal 
(evaluated in Joseph & Janda (in preparation)). 

3. We do, however, refer on occasion to facts about nominal 
reduplication. We exclude from consideration here more sporadic 
types of reduplication such as the a:mredita compounds, consisting of 
a repetition of a word, with Joss of accent in the second member, for 
an "intensive, or a distributive, or a repetitional meaning" (Whitney 
section 1260), e.g. vayam-vayam 'our very selves' (cf. yayam 'we'). 
Still, clearly any comprehensive treatment of the variety of 
reduplication rules in Sanskrit would have to take such types into 
consideration as well. 

4. We are admittedly being somewhat eclectic in our choice of 
examples, taking them from all chronological stages of Sanskrit (e.g. 
Vedic as well as Classical Sanskrit). Our primary source is Whitney 
(1885, 1889), two of the classic Western grammatical studies of 
Sanskrit. In part our eclecticism stems from our belief that 
knowledge of the older language persisted into at least the 
beginnings of the Classical period (e.g. Vedic forms are noted in 
Pa:nini's grammar) and so was an area of at least passive competence 
for many speakers. Also, many patterns we present here as 
illustrating a certain type can be found in a variety of stages of 
the language, even if a particular example may be restricted to one 
period. We have not in general indicated the age of any given form, 
except where such information is important. 

5. The identification of this form is complicated a bit by the 
existence of another form of the root, as^, the desiderative of which 
follows the formation with nonprefixal reduplication discussed in (6). 

6. These last three forms are from the Vedas and Brarhmanas only; 
there appear to be no VC- nonperfects to be found in Classical period. 

7. It may be, though, that the lack of aspiration on the d in 
bad-badh- is the result not of prespecification of (-aspirated] for 
the entire reduplication syllable, but instead of the independent 
workings of Bartholomae's Law (giving an intermediate stage 
/badbhadh-/ from underlying /badh-badh-/) and Grassmann's Law (giving 
the attested form). 

8. This pattern is found in later stages of Sanskrit as well, but 
by Classical times, the predominant (-aspirated, back] 
prespecification prevails, as in Classical bari: -bhr- (intensive stem 
of bhr- 'bear', cf. (4b)) and jari: -hr- (intensive stem of hr- 

*take , cf. (4a, c)). Note also that'we are here purposely~excluding 
forms such as the desiderative dhitsa- from dha:- 'put' (i.e. 
/dhi-dh-sa-/) which show aspiration probably as a result of analogy 
and which are synchronically formed— despite their diachronic origin 
in reduplication —via an internal change process limited to (a 
subtype of) desideratives— see Sag 1976 and Schindler 1976 for some 
discussion of these desideratives as well as other such forms with 
aspiration in the apparent reduplication syllable. However, bringing 
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in such forms could only bolster our claims about the lack of 
uniformity in consonantal pre spec if ication. 

9 That, this is not simply a dvandva (copulative) or a:mredita 
(distributional) compound hut instead a true adverbial derivation via 
reduplication is shown by the fact that there is a fixed pattern of 
vocal ism for the unit— the first element always ends in -a:- and the 
second element in ^i* See footnote 3 above, thong]), concerning the 
compound reduplications. 

10. The form with the vocalism -i'- is found in the rather late 
Vedic Aritareya Arranyaka, with -i- found both earlier and later on. 

11. If surface e/o in Sanskrit are to he treated as underlying 
diphthongs (i.e. respect Holy, as ay/ aw , with consonantal y_/w), then 
(5c, d) may not really illustrate differential vocalism. 

12. When the aorist stems are used to form a true past tense, an 
inflectional prefix a- is added outside of the reduplication 
syllable, e.g. a- ti: -tap-at 'she heated' (aorist of tap- ). Similarly, 
lexical prefixes can he added outside of the reduplication syllahle. 

12a. George Cardona (personal communication — 1/25/86) has 
suggested that, following Pa:nini, one might analyze the 
edidhisa type of reduplicated stem as involving rightward iteration 
(copying) of the second syllable of the root-plus-desiderative suffix 
sequence (i.e. edhisa- — • > edhidhisa- — •■> edidhisa- ) rather than 
the infixation into the root itself of a reduplication syllable, as 
we suggest here. To a certain extent, the analyses are not really 
very different, for in both of them, the reduplication syllahle is 
infixed— in our analysis, it is infixed in the root, whilcin the 
other* it is infixed within the stem formed by the addition of tho 
desiderative suffix. Nonetheless, we believe that a Pa:ninian-style 
analysis is to he rejected, for two reasons. First, the intermediate 
stage /edhidhisa**/, as is evident above, must become (edidhisa-] by a 
deaspi ration process. This deaspi ration seems in all respects to he 
similar to the deaspi rat ion (the remnant of (irassmann's Law) that is 
regular in reduplication when two aspirated consonants come to occur 
in successive syllables (see above in section 2 and the forms in 
(3)). However, the edidhisa-type deaspiration must he triggered hy a 
nonroot segment, since the second aspirate is the copied 
(reduplicated) element, whereas normally (with the exception of the 
clear relic forms jshi 'strike! (2SG.IMPV of han- ), bodhi 'become!' 
(2SG.JMPV of hhu:-), and vidntha 'distribution' (derived from dim:- 
'distribute' )— see Schindler (1976: 626)), enly segments that are 
part of the root trigger the deaspiration. Tims the Pa:niuian 
analysis requires a complication in the statement of the deaspiration 
process. Second, not only is there deaspiration (as regularly in 
reduplicated syllables, in our analysis) in the edidh:so -type of 
formation, hut there is nlso pnlatalization of hack consonants, as 
shown by arjihiyn- , desiderative stem of arh- 'deserve'; as noted in 
section 2 (and see the forms in (3)), such pnlatal izntion also is 
regular with segments in reduplicated syllables. In t)u* Pa'nininn 
nnnlysis, there is no reason to expect an intermediate stage 
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/arhi-hi -sa- / to develop into the attested arjih isn , since Jiij- is a 
permissible sequence phonotactically , and the leftmost .hi_ is part 
of the root not part of the reduplicated syllable. However, under 
our analysis, the j.i in arjihisa is the reduplication syllable, so 
that the palatalization is expected. Thus, our root infixing 
analysis presents no complications in the phono logical processes 
associated with reduplication. 

13. Admittedly, it is quite difficult to know exactly what the 
expected forma should be, since vowel-initial roots present a rather 
mixed assortment of reduplications of all types, even at the earliest 
stages; the pattern of edidhisa - is a likely candidate to have been 
the model since it is attested somewhat early (in the late Vcdic 

Va: jasaneyi-Samhita: ) and since a plausible path of development can 
be inferred for it. Indeed, the limited spread of the ejdidhi^a.- type 
suggests that this provided for speakers a relatively satisfactory 
solution to the problems posed by thrse vowel' initial roots. 

14. We omit here one form, the Vedic hapax legomenon j£guri- 
' steep (?), leading (?)', not only because of its obscurity, but 
because of disagreement as to its etymology. Only if it is connected 
with jr-;- 'waste away', as Whitney (1885: 55) believes, does it show 
reversion. Mayrhofer (1956), however, more plausibly connects it 
with gur^ 'lift up', citing phonological prohlems with the jr- 
etymology (since jr- is from Indo-European tg'er-, and the palatal 
*g' should never yield a Sanskrit (gj). We arc also excluding 
sporadic instances of reversion not connected to reduplication, such 
as Rig Vedic 2P1. middle aorist a_-_srg-ran from srj^ *scnd forth'. 

15. The native grammarians divide these forms into two roots, 

lei I and cit_. despite thc.ii etymological identity, so that under 

such an analysis, there is no reversion to speak of, but only 
parallel formations from parallel roots. 

15a. As George Car don a has kindly pointed out to us, this form, 
despite Whitney's (1885: 205) clasification of it as connected to 
hi! 'impel', may in fact represent a different root. 

IK. If the difficult epic apparent intensive 2SG imperative 
j i | jah i is a foim of htm_, then this would be an example from this 
root without the reversion. 

16a. It is important to point out, as Wolfgang Dressier has 
kindly reminded us, that the occurrence of T. ..ST- in the 
reduplications of the (8a) type is not the result of a general 
phonotnetic constraint operative in the language prohibiting 
sequences of ST... ST in successive syllables oi within the same 
word. Fotms such as astos ja 'is)he praised' (3S(J s-aorist of stu- 
* pra i s e ' ) and s t ha : s isja ' ( s ) he s t ood * ( 'A SG s i£-aor i s t of st ha, :. - 
'stand*) show that the situation found in these reduplications is not 
a matter of Sanskrit plionotact ics, and thus the pattern illustrated 
in (8a) is probative for demonstrating the clustering of the various 
reduplication rules. 
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17. Thus, there are many non reduplicated present stems (ten 
classes in all, with reduplicated stens Making up only one class, 
representing 4% of the total — to go by Whitney's statistics) and 
aorist stems (seven formations in all, of which reduplicated stejas 
make up only one type, representing 21* of the total — again based on 
Whitney's statistics), a subclass of desideratives without 
reduplication (see footnote 8 above), and even a few perfect stems 
with no reduplication (e.g. ved- from vid- 'know'). 

17a. It might be thought that the generalization illustrated by 
the forms in (8a) is a static truth about the language but not 
necessarily one that shows that speakers actually made the connection 
among the various rules in the way we suggest they did. However, two 
of the sibilants that participate in the (8a)-type reduplication aie 
sounds that developed within the history of Indo-Iranian and/or Indie 
(the palatal s being the Indo-Iranian and Indie outcome of the 
Proto-Indo-European palatal stop #k' and the retroflex s. being a 
specifically Indie development of *s in a variety of environments). 
Thus the fact that all three sibilants behave alike with respect to 
reduplication of #ST- roots shows that speakers at some point in the 
history of Sanskrit made the generalization across reduplication 
tules that we are claiming, thus providing a degree of unity for the 
various rules in question. 

18. It is, moreover, the view that is implicit in most of the 
traditional grammatical studies of Sanskrit, e.g. Whitney 1889. 

19. This is the native grammarians' seventh class. "Strong" 
versus "weak" foims of the stem are diutributed according to 
morphological category, e.g. singular versus plural, active versus 
middle, etc. The underlying /n/ of the infix changes to (n), [ft) , 
etc. in predictable phonological environments. 

20. As a reduplicated present, ja; grmi would be unusual in having 
a weak grade second syllable, and the absence of any intensive 
meaning is noteworthy. 

21. In fact, another detailed study of Sanskrit reduplication and 
related topics has recently appeared (Steriade 1985), although we 
gained access to it too late to permit further accouut to be taken of 
it in this paper. Still, it is significant that Steriade 1 s paper, 
while involving a diffeient focus and approach from ours, completely 
supports the idea that Sanskrit has far more than just one 
reduplication iule, and so provides independent motivation for our 
proposed concept of "rule-constellation". Steriade employs a 
Marantzian autosegnental approach to reduplication in Sanskrit, and 
this gives us a chance to add here one final remark on this general 
type of analysis. We firmly believe that the essentials of the 
autosegmental view of phonology and morphology have much to 
contribute to the analysis of reduplication both in Sanskrit and 
crosslinguistically, and our analysis in this paper is couched mainly 
in autosegmental terms. Nevertheless, we are not convinced of the 
necessity of adopting the Marantzian variation on this theme whereby 

t lie entire phonemic melody of a root or stem is copied over a 




ERIC 



105 



reduplicative template, regardless of how much of that melody 
actually appears on the surface. For arguments in favor of 
alternative approaches involving more limited copying (albeit with, 
freer copying power), see Janda 1984, Hoeksema & Janda 1985. 
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Fragmentation of Strong Verb Ablaut in Old English* 



Keith Johnson 



1. Introduction 

It has recently been claimed (Janda & Joseph, 1986; Bybee, 1986) 
that the normal line of development for morphological processes of 
language is from simple to complex, from unitary processes to 
fragmented ones. This claim goes completely counter to the standard 
generative approach to language change in which change is viewed as a 
type of simplification (Kiparsky, 1968). Janda fc Joseph cite several 
examples of fragmentation (among them are Sanskrit reduplication, the 
English adjectival suffixes -al, -ar, and -able, cud German 
umlaut) which lead them to the conclusion that 'morpho lexical 
fragmentation - of reduplication, of nonreduplicative affixation, and 
of morphological processes in general - is indeed the crosslinguistic 
norm, both as a synchronic state and as a diachronic change' (Janda & 
Joseph, p. 113). In their view and in the view expressed by Bybee 
(1986) fragmentation as a diachronic development is the normal, 
natural line of development for morphological processes. 

In this paper I demonstrate another case of morphological 
fragmentation - strong verb ablaut in Old English (OE). The 
conclusion of the paper is that the process of ablaut which was 
inherited by OE from Indo-European (IE) was fragmented as the result 
of sound changes and that this fragmentation resulted in 
particularization of ablaut patterns to smaller and smaller sets of 
lexical items. One of the theoretical implications of this- account is 
that sound change seems to normally work by changing lexical 
representations, not by adding, deleting or simplifying phonological 
rules. Also, the facts reported here seem to indicate that the 
synchronic result of fragmentation (in this case) is a pattern of 
relations within the lexicon not a set of rules (i.e. not a 
rule-constellation*). Finally, the study indicates that 
speakers/hearers focus on the signifying functions of specific lexical 
items rather than general structural patterns (Bybee* s (1986) 
independent development vs. structural coherence) . 

2. Strong Verb Ablaut in OE 

Verbs in Germanic can be classed as weak or strong according to 
whether their past tense is characterized by an alternation in the 
stem vowel. Traditionally, the strong verbs are divided into seven 
classes according to the structure of their stems. The number and 
type of post-vocalic segments in the Germanic stem determined the 
ablaut pattern of the stem vowel. Typical examples of the strong 
verbs of classes I - VI are listed in table 1. 

Sound changes have obscured the pattern of the post-root 
segments. The first two classes had the vowels i and u, 
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respectively, after the root vowel. Class III was characterized by a 
post-root sequence of a resonant followed by an obstruent. Class IV 
had a single resonant following the root vowel, while class V had a 
single obstruent in this position. 



Table 1 

The 0E strong verb, classes I - VI 
from Lass and Anderson (1975, p. 25) 



Class 


Pre*. 


Pret. Sg. 


Pret. PI. 


Part. 




I 


bidan 


bad 


bidon 


-biden 


'wait 1 


II 


beodan 


bead 


budon 


-boden 


'command 1 


IlKa) 


helpan 


healp 


hulpon 


-hoi pen 


'help 1 


(b) 


weorpan 


wearp 


wurpon 


-worpen 


'throw* 


(c) 


bindan 


band 


bun don 


-bunden 


'bind 1 


IV 


beran 


b*r 


bJtron 


-boren 


'bear' 


V 


sprecan 


spr<c 


spriEcon 


-sprecen 


'speak' 


VI 


bacan 


boc 


bffcon 


-bacen 


'bake' 



2.1 Origin of the ablaut patterns 

In this section I will discuss the IE origins of the 0E ablaut 
patterns. I will show that the patterns illustrated in table 1 are 
not derived historically from the same IE pattern, but constitute a 
mixture of several types of IE verb forms. This account is taken 
primarily from Prokosch (1939) and Campbell (1959). 

The present tense of the Gmc. strong verbs of classes I - V was 
characterized by the vowel e. This vocal ism comes down to 0E 
unchanged in the class IV and V forms beran and sprecan. The 
class I sequence ei became T in Gmc. The class II sequence 
euvas changed in 0E to eo. Two regular sound changes 
affect the vowel in class III. Breaking produces the diphthong eo 
before r in weorpan> and raising before nasals resulted in 
i in bindan. This latter change also affected some verbs in 
class IV (eg. niman 'to take'). 

The preterite singular in classes I - V is a continu ion of IE 
perfect (cf. GrkAfi*" *Xf\t,n* ) 3 and has the IE ablaut vowel to > 
Gnc. a. The ablaut forms of the first five classes are thus: (I) 
*ai > a (Campbell, 1959, 134); (II) au > ea [*>:a] 
(Campbell, 1959, j 135); (III) ta >z (Campbell, j 131) except before 
nasals, then TilC > eaLC ((*a]) (CampbeJ 1, j 143-144); (IV, V) 
*a >* (Campbell, ^ 131) . 

The preterite plural 4 in the first three classes of strong verbs 
had reduced grade (hence IE i t u t R> Gmc i,u,uff) t while classes 
IV and V have Gmc from IE e. in 0E this developed as * 
except before nasals where it became o. The 2s g ending in 
W.Gnc. was -iz > OE This ending is probably from an IB 
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aorist of the type of Grk. f-A«r-€> . since some IB aorist formations 
were also characterized by reduced grade ablaut it is possible that 
the OB pret. 2sg. is the later development of an IB aorist rather than 
an IB perfect. An IB aorist origin is also likely for the pret. pi. 
endings* of which Prokosch (1939) says, 'the Gmc. 3pl. in -ua 
cannot be perfect endings, and the lpl. in -us* need not be one; 
the 2pl. in -u} is surely analogical 1 (p. 163). (OE generalized 
the 3pl. ending to all pi. fores.) Thus, it is possible that the 
reduced grade of the vowel in the pret. pi. of classes I - III 
reflects an IB aorist rather than an IB perfect. These considerations 
led Prokosh to attribute the pret. pi. of classes IV and V in e 
to an IB aorist which he also sees in Grk. $(Zt\v and Lat. vcn- 
and scd- (j 57). Because these Grk. and Lat. forms have 
other, more probable origins it is perhaps better to consider the 
e form as a later development and peculiar to Gmc. » although it 
must have been present at a very early stage of development (all of 
the Gmc. languages show a reflex of it in class IV & V verbs). 

The Michels-Streitburg theory 6 of the origin of a long vowel in 
classes IV and V is dismissed by Campbell (1959) - 'This theory 
involves many difficulties (especially with regard to the 
simplification of consonant groups) and is better abandoned' (p. 305, 
n. 1). Another objection to this theory is that it provides no 
principled explanation for the preservation of reduplication in 
classes IV and V while the reduplicated syllables in the other classes 
were simply haplologized. 

Lass and Anderson (1975) proposed that the long vowels result 
from a grammatically conditioned rule such as (1) which 

(1) Pret. pi. lengthening: from Lass and Anderson (1975) 

V — >C[-long) 

< [+long)/ CV 

/ Pret2 

lengthens pret. pi. vowels when they are followed by only one 
consonant (i.e. classes IV and V in Gmc). This requires that 
underlying representations be quite abstract. The arguments below in 
favor of surfacy representations also constitute arguments against 
this view of the origin of the long vowel in classes IV and V pret. 
pi. forms. 

Wherever the pret. pi. in classes IV and V originated these 
considerations indicate that 'the Gmc. strong preterite is not a 
homogeneous development from one source, but a combination of several 
types' (Prokosch} 1939, p. 164). This is one indication that the 
strong verbs in OB started out fragmented to some degree. As will be 
indicated below classes VI and VII present further complications. 

The passive participle ^ in the first three classes had the same 
form as the pret. pi. in Gmc. In classes IV and V instead of being 
completely reduced (as in I - III) the root vowel was 'schwa 
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secundum' . In Gmc. this IE sound became u before resonant 
consonants and e before obstruents. Thus, pass. part, forms had 
u in class IV and e in class V. The u which was present 
in the pass. part, stems of classes II, III, and IV was changed to 
o presumably due to the influence of the vowel in the pass. part. 
suffix. 8 The sound change was that u > o before mid and low 
vowels. 'There are, however, many exceptions in OB' (Campbell, p. 
43). The fact that there are very few, if any, exceptions among the 
pass. part, forms of strong verbs will be discussed below. 

For the purposes of this paper it is enough to quote Campbell 
concerning classes VI and VII. 'The verbs of Classes VI and VII are 
of a different type' (p. 305). The IB origins of the forms for these 
classes are not very clear at all (see Prokosch, $^60 - 62 for a 



Table 2 

Origin of the ablaut patterns: summary. 



Classes: 
I 

II 
III 



IV 
V 

VI 

Endings 

Sg, 1 
2 
3 

Pi. 

part. 

prefix 



pres. 

*e-i>I 

*e-u>eo 

*e>e/jO 
>eo/ rO 
>i/_N0 

*e>e 
>i/_N 

*e>e 

*a 



*-om>-*>-e 

*-isi>-st 

*-i£i>-t 

*-ari^i>-a 



pret. sg. 

*o-i>ai>a 

*o-u>au>ea 

*o>a>ea/_L0 

>a/_N0 

*o>a> '<£ 
>o/Ji l 

*o>a>£ 

*o 

*-a>-*J 
*-e>-fi 



pret. pi. 

*^-i>i 

**f-u>u 

**SL>L>uL 

*fSN>N>uN 

*e>& 
>o/_N 

*e>* 

*o 

*-es>~iz>-e 
*-un >-un>-on 



pass. part. 

*^-i>i 

*^-u>u>o 

**JL>L>uL>oL 

**W>N>uN 

* A R>uR>oR 



*-ono>-on> 
-an>-<n>-en 
ge- 



by analogy. 



0 = obstruent consonants 
L = liquid consonants 
N = nasal consonants 
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discussion of the posibilities) . Class VI verbs have two likely IE 
sources (1) thematic presents with suffix accent (also called 'aorist 
presents') and (2) 'weak presents' with a j-suffix as is common with 
weak verbs in OB. Class VII includes some vestiges of IE 
reduplication along with a number of verbs of uncertain etymology. 
The formal characteristics of the class VI and VII verbs include: (1) 
the pres. stem and the pass. part, normally have the same vowel, while 
the pret. sg. and pi. have the same vowel, (2) class VI has o in 
the pret. sg. and pi., (3) class VII has either e or eo in 
the pret. sg. and pi. (4) the vowel of class VI pres. and pass. part, 
is a for a large number of cases but there is more variety here 
than in the pret. forms, (5) class VII pres. and pass. part, forms are 
most often in H a, ea. 

Table 2 is a summary of this section which illustrates the point 
that OE ablaut was never a unified phenomenon. The Gmc. background of 
the strong verb forms was a mixture of several different patterns from 
IE. So, from the very beginning of OE there was not just one ablaut 
pattern but several. ^ 

2.2 Further fragmentation within the classes 

This section discusses further types of fragmentation which 
occur in the OE strong verbs. In table 3 which is a summary of this 
section 28 different ablaut patterns are listed. 

Contract verbs in class I formed an alternate ablaut pattern to 
the usual one. When Gmc. [X] occurred intervocalically it was deleted 
in OE. Thus, from the Gmc. present tense form twrihan 'cover' 
(h=(pf]) OE has wreon as the result of regular sound changes: 
This resulted in the ablaut pattern: eo a i i. Other class I 
verbs which have this pattern are: leon 'grant' , *peon 
'thrive', seon 'sieve', and teon 'accuse'. 

There is a fairly large set of verbs (13 out of 51 basic class 
II verbs in Borden, 1982) in class II which show a present in u 
instead of the expected eo. These verbs probably originally had 
a reduced grade present stem (Prokosch, J58,b calls these aorist 
presents). Campbell suggests that the lengthening of u may have 
been by analogy with class I 'verbs with ai in the past had 
I in the present system, those with au in the past might 
develop u in the present system' ( J* 736, b). At any rate, 
another ablaut pattern for class II verbs exists. 

Class III is already represented by three different ablaut 
patterns according to the resonant following the stem vowel. To 
complicate matters a little more we also note that there are some 
aorist presents which are normally included in this class also 
(spurna/it 'spurn', speam, spvrnon, spornen). Also, there is 
a set of verbs which were originally class V (with one post-root 
obstruent) but 'their presents were extended by the addition of a 
dental element, e.g. OE streg-d-an strew, feoh-t-an fight, 
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Table 3 

Ablaut patterns in OH strong verbs, 
from Levin (1964). 



I 


la 


i 


a 


i 


i 


bidan 


bad 


bidon 


biden 


'await* 


I 


b 


eo 


a 


i 


i 


wreon 


wrah 


wrigon 


wrigen 


■ cover* 


II 


2a 


eo 


_ 
ea 


u 


o 


beodan 


bead 


budon 


boden 


'command* 


II 


b 


u 


ea 


u 


o 


brucan 


breac 


brucon 


brocen 


'use* 


III 


3a 


i 


a 


u 


u 


bindan 


band 


bundon 


bunden 


'bind* 


III 


b 


e 


ea 


u 


o 


he 1 pan 


healp 


hulpon 


hoi pen 


•help' 


III 


c 


eo 


ea 


u 


o 


weorpan 


wearp 


wurpon 


worpen 


•throw* 


III 


d 


u 


ea 


u 


o 


spurnan 


spearn 


spurnon 




' spurn' 


III 


e 


e 


X. 


u 


o 


stregdan 


str«gd 


strugdon 


strogden 


'strew* 


IV 


4a 


e 




£ 


o 


beran 


b*w 


b*"ron 


boren 


'bear* 


V 


b 


e 


* 




e 


aie tan 


nr*t 


id it on 


me ten 


'measure* 


V 


c 


"So 


ea 


£ 


e 


seon 


seah 


sJcgon 


segen 


'see' 


V 


d 


i 






e 


t iddan 


bxd 


b*don 


beden 


'pray' 


VT 


5a 


a 


0 


0 


a 


faran 


for 


foron 


faren 


'go' 


VI 


b 


ea 


0 


5 


a 


slean 


slffg 


s logon 


slagen 


'strike' 


VI 


c 


e 




0 


a 


hebban 


hof 


h6"fon 


hafen 


'raise' 


VI 


d 


ie 


0 


5 


ea 


scieppan 


scop 


scopon 


sceapen 


'create' 


VI 


e 


A- 


6* 


0 




st ppan 


stop 


stffpon 


st*pen 


'step' 


IV 


f 


U 


o" 


5 


u 


ciunan 


com 


coiion 


cunen 


'come' 


IV 


g 


i 


0 


0 


u 


ninan 


non 


ndmon 


nuaen 


'take' 


VII 


6a 


a 


e 


e 


a 


hatan 


het 


he ton 


hat en 


'call' 


VII 


b 




e 


€ 




l*ten 


let 


iSton 


l*ten 


•let' 


VII 


c 


5 


e 


e 


a 


fon 


feng 


fShgon 


fangen 


'seize' 


VII 


7a 


a 


eo 


eo 


a 


bannan 


beonn 


bSonnon 


bannen 


' summon ' 


VII 


b 


ea 


5o 


eo 


ea 


fealdan 


feold 


feoldon 


fealden 


'fold' 


VII 


c 


3 


eo 


So 


a 


b law an 


bleow 


blSbwon 


blawen 


'blow' 


VII 


d 


£a 


So 


eo 


ea 


beat an 


bffot 


bSbton 


beaten 


•beat' 


VII 


e 


o 


So 


So 


Z 


blotan 


blSot 


blSoton 


bloten 


•sacrifice* 



Traditional class identifications are given in the first column, followed by 
the reclassification suggested by Levin based on the preterite vocalism. 
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frig-n-an ask' (Campbell, 5*736,b). Thus, new patterns are added 
for stregdan (e,3i,u,o), and frignan (i,^u,o)* 

The pattern i,o,o,u for the class IV verb niman 
'take' can be derived from Table 2 (although not specifically 
discussed in the previous section). The aorist present verb cuman 
'come' aas almost the same pattern: u,6,6,u. Thus, two more 
ablaut patterns must be added to those already identified for class IV. 

There are seme contract verbs in class V which result in a 
different ablaut pattern (seon, see, seah, s*gon, segen or 
in Angl. seon, seah, sawon, seweri) . There is also a set of 
weak presents which have present forms resembling class I weak verbs. 
Otherwise they have ablaut forms like normal class V verbs 
(blddan, 'ask, "pray', bid, bxdon, beden) . 

These same two types of variation in ablaut are found in class 
VI verbs. Contract verbs such as slean, 'slay' have a pattern 
different from the normal class VI ablaut pattern (slean, slay, 
slog, slogon, slagen). There are also weak presents of three 
different types — hebban, 'raise' has the paUern: e,o,o,a; 
scieppan, 'create' has the pattern: ie,o*o,a\ and 
stxppan 'step' has the pattern: at, 6,o,Z. 1 

Class VII verbs are divided into two basic categories according 
to the vocalism of the pret. forms (e and eo). There are 
three different patterns with pret. forms in e, and five 
different patterns with pret. in eo. 

Table 3 is a collection of the different ablaut patterns which 
have been discussed in this section and the previous one. Levin 
(1964) chose to class the verbs by their pret. forma and so the 
traditional classes are spread out to some degree among Levin's 
classes. The degree to which speakers of OE made such identifications 
is unknown. The issue will be addressed briefly in section 4 below. 

3. Lexicographical evidence concerning the strong verbs 

In a survey of the strong verbs listed in Borden (1982) and 
their corresponding forms in Middle English (Stratmann and Bradley, 
1891)12 J f oun( j that they were productively used in word formation 
processes in OE and that they survived into MB at uniformly high rates. 

Table 4 summarizes the results of this survey. The first row 
indicates the total number of verbs in each class listed in 
Borden(1982). The second row shows the number of these verbs which 
are not the result of derivational processes (ie. helpan 'help' 
vs. ahelpan 'support'). The percentage of the total which are 
underived forms (third row) indicates that although their absolute 
values are quite different all of the classes have roughly the same 
ratio of basic forms to derivations. These percentages indicate that 
there are approximately four derivations for every basic form in the 
strong verbs. This is evidence for a certain degree of productivity. 
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Table 4 



Lexicographic evidence concerning OB strong verbs. 





I 


II 


III 


IV 


V 


VI 


VII 


explanation 


1. 


283 


270 


423 


100 


171 


192 


290 


Tot. # of verbs 


2. 


61 


52 


85 


15 


30 


27 


55 


# of basic forms 


3. 


22* 


19* 


20* 


15* 


18* 


14* 


19* 


* basic forms of tot. 


4. 


8 


14 


12 


4 


8 


7 


17 


# of anom. basic fas 


5. 


13* 


27* 


14* 


27* 


27* 


26* 


31* 


* anom. of basic fms 


6. 


42 


40 


60 


11 


22 


21 


42 


# of basic fms in ME 


7. 


69* 


77* 


71* 


73* 


73* 


78* 


76* 


* ME of basic fms 



The fourth row in table 4 is the number of basic forms (as 
listed in the dictionary - ie. present tense form) which do not match 
the forms predicted by table 2. The percentage (row 5) of the total 
number of basic forms (row 2) which are anomalous in this sense 
indicates the degree of fragmentation. With the exception of classes 
I and III the percentage of each class which does not fit the basic 
description of the class is about 28*. This is an indication that the 
fragmentation discussed in section 2.2 is more than just an occasional 
exception to an otherwise overwhelmingly stable pattern. 

Row six is the number of basic forms which came down to ME as 
strong verbs (as found in Stratmann and Bradley, 1891). In all cases 
the percentage of OE basic forms which survive into ME is quite high 
(row 7, X=74*). This indicates that the fragmentation of OE strong 
verbs did not cause speakers to avoid using them. They do not seem to 
be particularly difficult for speakers. 1 ** 

This survey of lexicographical ovidenc indicates then that 
although strong verbs were indeed fragmented to a substantial degree 
they were nevertheless included in word formation processes and were 
not abandoned over the period from OE to ME. 

4 . Evidence agains t abstractness 

If the lexical representations of strong verbs were sufficiently 
abstract then the fragmentation which was demonstrated in section 2 
would indicate nothing more about OE than that there were a number of 
interesting synchronic rules which caused the surface forms of strong 
verbs to have some variety. It is my claim in this paper that 
fragmentation resulted in variation in the lexical representations of 
strong verbs — not just variation in their surface forms. In order 
to establish the claim that fragmentation results in changes in 
lexical representations it is necessary to show that the lexical 
representations involved are not abstract. There is a wealth of 
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evidence in the OE strong verbs which rules out abstract 
representations. Some of this evidence will be presented in this 
section. 

4.1 Analogies 

Lexical representations are the representations in memory of the 
words that speakers/hearers use. They are memories, and like other 
memories they have varying degrees of strength. * 4 When our memory for 
a word is weak we may rely on the pattern of a similar word which is 
■ore clearly remembered. * 5 This is analogy. The point is that 
analogy is from one lexical representation to another in this sense of 
'lexical representation*. 

There are some cases of analogy in the OE strong verbs (to be 
discussed below) in which surface forms such as those listed in table 
3 are the basis for analogy. This indicates that the lexical 
representations for these forms are like their surface forms. 

The analogical lengthening of the aorist present in class II 
(table 3, 2b) verbs in Gmc. has already been mentioned. Crucial to 
this analogy is that the class I present form is / in the 
1 exicon ( ai : !: : au:X) . 

Also, the lengthening of o in pret. sg. in class IV verbs 
with post-vocalic nasals (table 3, 5f & g) has been attributed to 
analogy (in the note to table 2). There are two possible models for 
this analogy. If the class IV pret. pi. is the model this would 
indicate that (1) the sound change e > <vLAfwas a change in 
lexical representations, and (2) that the pret. pi. was a lexical 
representation - not derived form the pres. form. If the class VI 
pret. sg. served as the model for this analogy the lexical status of 
the pret. sg. form is indicated. 

A variant which occurs beside nom pret. sg. of niaan 
(table 3, 5g) is nam. This variant 'is due to analogy of nasal 
verbs of class III' (Campbell, p. 313, n. 1). The formula: 
bindan:band: :niman:X indicates once again that the pret. sg. 
(band) is stored in the lexicon. 

There is a tendency for contract verbs of class I in eo to 
shift to class II. Thus, teon 'accuse' has teah, tugon in 
addition to tab, tigon. About four percent of the class I basic 
forms shifted to class II in this way. This analogy illustrates the 
sufficiency of surface identity (eo, eo) for the occurrence of 
analogy. It also illustrates the structural coherence of the ablaut 
pattern. The pattern is productive in the sense that it can be 
extended to new verbs in the class. Note also that it is the surface 
pattern that is extended. 

Finally, the possibility of analogical forces involved in the 
form of the pass. part, in classes II, III, and IV should be 
mentioned. As was mentioned earlier the sound change lowering u 
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before a following mid or low vowel was somewhat erratic in 
application. (Note the exceptions: ufan 'over', ufor 
'higher', pusa 'bag', sugga 'a kind of fish', and tube 
'trumpet'.) The fact that it was not eratic in the case of the pass, 
part, of strong verbs may be an indication of analogy. The o 
vocalism nay have taken on some grammatical/semantic value that helped 
the sound change take place in all pass. part, forms.* 6 Once again, 
separate lexical representations (and non-abstract ones) are suggested 
by the evidence. 

4.2 Non-application of sound changes 

Breaking did not apply to berstan 'burst 1 or ~perscan 
'thresh' . These words were originally class V verbs formed by the 
addition of an infix with a dental consonant (stregdan was given 
earlier as an examrlo). They have also undergone a metathesis. Thus, 
the Gate, forms were *bres-t- and ^re-sk-. 

The non-application of breaking could be taken quite simply as 
an indication of rule ordering. The derivations would be something 
like (2). 

(2) /bres-t-an/ /ire-sk-an/ 

breaking 

metathesis berstan ^erscan 

Campbell 459,1) has metathesis in this case as rV > Vr/_s 
or n. Exceptions include: hrespan 'to strip, spoil', 
cranio 'record', cranoc 'crane', cristalla 'crystal', 
bresne 'nighty', bracsian 'to do work in brass', brastlian 
'to roar, rustle', restan 'to rest', rendan 'to rend', 
scrind'amft course', strand 'sea-shore' , J>rosa 'smoke' , 
trandende 'precipitous, steep', trendon 'to turn around', 
pines 'Trinity', Jrintan 'to swell', Jryscan 'to wiegh 
down'. , 

The order of the 'rules' in (2) has a phonetically motivated 
'low level' rule followed by a lexically specific 'high level' 
rule. 17 A simpler theory of phonology is possible if we treat 
breaking as a sound change which changed the lexical representations 
of the words to which it applied (ie. /werpon/ > /weor&z**/ 
'throw'). After this sound change had ceased to be active another 
sound change (metathesis) produced lexical representations which could 
have undergone breaking if they had existed when break in? was active. 

Metathesis also affected the class III verb brinnan 'burn' 
which with metathesis had the forms: birnan, barn, burnon, burnen 
(Campbell, ^741 - compare table 3, 3a). In this case we expect 
breaking before r (like weorpan> table 3, 3c) but instead the 
ablaut pattern does not change at all. In addition to the 
non-application of breaking, the preservation of the effects of 
raising before a nasal consonant is interesting. This rule is 
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formalized by Lass & Anderson (1975, p. 29) as (3). 



(3) 



V — > +high / +nesal 

-low 



Thus, bindan and brinnan are derived from bendan and 
brennan respectively. The rule also blocks the application of 
back umlaut in the pass. part, forms L widen and brunnen. The 
stages of development for birnan were thus: 



As in the case of the interaction of breaking and metathesis, the 
historical order of these changes is not the preferred synchronic 
order. If the lexically specific 'higher level' rule (metathesis) is 
ordered before the 'lower level' phonetically motivated rule 
(raising), then metathesis would bleed raising, giving bernan 
instead of birnan* It seems preferable in this case to view 
raising before nasals as a sound change which had run its course 
before metathesis took place. In this view, (3) is not a synchronic 
process, but represents the lexicalized results of a sound change. *-° 
This is more clear evidence against abstract lexical representations. 

4.3 Borrowing a Latin ve rb 

Finally, the borrowing of the Lat. verb scrlbere as a 
class I strong verb scrifan 'decree' is evidence for the non- 
abstract representation of class I verbs. Here, as in several of the 
cases of analogy mentioned earlier, surface identity is enough to 
associate the verb with class I verbs with present stem vowel 
/• * The borrowing also indicates the synchronic salience of 
the ablaut pattern, although the fact that this was the only verb 
borrowed into the strong verb system (while many were borrowed as weak 
verbs) is evidence of the tendency to generalize the weak verb pattern 
at the expense of the strong verb pattern. 

Conclusion 

Thus, the synchronic result of the fragmentation described in 
section 2 existed in the lexical representations of the words 
involved, not in morphological or phonological processes in the 
synchronic grammar of OS. The evidence presented in this paper is 
consistent with a view of language in which the lexicon (as words 
stored in memory) takes a central position. Although there is 
evidence (some of it presented in this paper) for the reality and 
potential for active use of patterns within the lexicon which might be 
described using process notation, the conclusion to which the evidence 
points is that these 'processes' are actually patterns in the 
lexicon. The appearance of behavior which indicates that some sort of 
association has been made (for instance, the analogy of class I verbs 
like wreon with class II verbs like beodari) cannot be 
taken as evidence for the existence of a morphological process. (In 
fact the evidence in this particular case indicates that a pattern 



(4) 



Gmc 

raising 
metathesis 



bren- 
brin- 
birr,- 
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among some lexical items has been extended to some other lexical 
entries.) 

The overall view of language sound systems which is indicated by 
the evidence in this paper is that language is rich in the lexicon and 
poor in the grammar - that most of the information needed to pronounce 
words is available in their representations in memory and that the 
modifications which they undergo between lexical retrieval and 
articulation are quite limited. Along with this view (and also 
indicated by the evidence presented here) is a view of sound change in 
which the lexicon is central. Sound change is seen as primarily 
change in lexical represent ions rather than grammar change. This view 
helps to explain why fragmentation occurs and why it is a common, even 
normal, situation: for each particular sound change some lexical 
representations have the appropriate conditions for change while 
others don't; therefore fragmentation can result from a series of 
sound changes. 



Notes 

*Many thanks to Brian Joseph. He is the one who told me that 
the only thing my paper needed was to be written, and his knowledge 
and eye for linguistic detail has saved me from some embarrassing 
mistakes. Some of my fellow students at Ohio State have also provided 
useful comments and valuable discussion ■■ Brad Getz, Peter Lasersohn, 
Joyce Powers, Jane Smirniotopoulous and Debbie Stollenwerk. Thanks 
also to the Crusaders for musical assistance. 

ijanda and Joseph (1986) found that the synchronic result of 
fragmentation in Sanskrit reduplication was a rule-constellation, i.e. 
'a group of formally similar morphological processes sharing at least 
one characteristic property of form but distinguished by individual 
formal idiosyncrasies which prevent their being collapsed th one 
another* (p. 104). 

2 The point is that speakers/hearers do not feel compelled to 
preserve generalizations in morphological processes. Fragmentation 
can be seen as replacing one generalization with several as the result 
of sound changes which obscure the original generalization. The fact 
that sound changes are not resisted when they threaten a 
generalization. C8n be taken as evidence for one of two accounts of 
sound change. (1) The generalization is not really threatened because 
the lexical representations are abstract enough to preserve the 
general pattern, or (2) speakers/hearers do not need big 
generalizations, because little ones (particularized to sets of 
lexical items) will do just as well. This last view (which will be 
argued for in this paper) entails that lexical items are more 
important semiotically than are morphological processes. 

%o Prokosch $56 'essentially a direct continuation of the IE 
perfect tense.' The singular endings in Gmc. are reconstructed as: 
-a, -tJia, (compare Grk. perfect f onus ct& , o7<A , o7£ . The 
2sg. in W. Gmc. probably comes from a different IB source and will be 
considered in the next section with the pret. pi. forms. 
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4ln W. Gnc. the pret. 2sg. form also has the ablaut vowel which 
appears in the pret. pi. forms. 

5* It is tempting to assume, though not definitely demonstrable, 
that Prim. Gmc. oL > a in the form of West Gmc. from which OB and 
OFris. were derived ... an'* that this a was then subject to 
change in two directions, becoming a before nasal consonants, x 
(or e) elsewhere. Such a double development of a would be 
parallel to the OB and OFris. treatment of a* (Campbell, 129). 
a > 6 is also more phonetically plausible than K > o. 

^According to this theory the long vowel is the result of a 
conflation of two syllables which result from reduplicaton (which is a 
typical marker of perfect forms in IB). 'Thus from the root 
sed,ait t the IB perf. pi. would be fsesed- > tsesd- > tsezd- 
> Gmc. *sZt-> (Campbell, 1959, p. 305, n. 1). 

7' It expresses pure passivity, not necessarily passivity in past 
time' (Campbell, 727). 

&The pass. part, suffix in Gmc. was -on. The three other 
suffixes which occur on stems with u (pret. sg. -iz, pret. pi. 
-urif and subj. pret. pi. -iri)>) all had high vowels in <3mc. If 
the sound change (u > o before a low or mid vowel) occurred before 
these suffix vowels were lowered then the appropriate environment 
existed only for the pass. part. 

^One has to wonder if any stage of any language can be found for 
which there is ngt evidence of fragmentation. Of course there can be 
local unity (ie. groups of lexical items which are treated similarly) 
but the trend seems always to be toward lexical specificity rather 
than morphological generality. From a diachronic perspective it seems 
to be fragmentation all the way down. 

10 ie in the present is the result of the influence of the 
palatal consonant c. 

H<fc.in the present results from a failure of umlaut across the 
cluster pp. The pass. * irt. form may be due to analogy with the 
present. 

12 This dictionary is more concise than the OB dictionary that I 
used and so there may have been a bit higher rate of retention than I 
am reporting here. The concise dictionary was more convenient to use 
and gives a good general idea of retention rate. 

13 Moder(1986) seems to indicate that the vestiges of strong verb 
ablaut patterns are still salient and extendable. 

l^In this view of what a lexical representation is it would be 
surprising if there weren't lexical representations for all of the 
forms in a paradigm. This is because each separate form (as it is 
encountered in language use) is an event to be remembered. That the 
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memories of the forms in a paradigm are related to each other is a 
given. Exactly how this is done is a matter for future research. 
Also, note that this view of the lexicon is similar to Bybee's (1985, 
pp. Ill ff. ) 'dynamic model' . 

evidence presented in Esper (1973, chapter 6) indicates 
also that a pattern of relations among words can lead to the invention 
of new forms. He points out the necessity of treating both the 
analogical creation of new forms and the analogical revision of old 
ones as instances of the same type of process. As regards the 
conception of lexical representation which I am adopting here, I am 
sure that he would reject my 'subjective mental ism' and prefer to talk 
only about those aspects of language that are observable (ch. 7). 

reinterpretation of o-vocalism as a marker of pass. part, 
is an instance of abductive change (Andersen, 1972). 

l7 This order, if kept, would require a substantial revision of 
phonological theory. 

Wit could be salvaged as a synchronic rule by complicating it 
with an optional r between the vowel and the nasax. However, 
counterexamples of this revision can be found from forma 'first' 
to terma 'end'. 

l%his raises the whole issue of speech perception with abstract 
representations. The counter-argument to my claim here is that upon 
hearing Lat. scrfbere OE hearers 'heard' /screib-/ and thus 
identified the verb as class I. The problem with this is that with 
abstract lexical representations OR hearers had to sometimes 'hear' 
[i] as /ei/ and sometimes as /!/. Now, if they had a paradigm for 
scrfbere in which there was evidence for a stem like /scr_ib-/ 
it might be possible to claim that they 'heard' /ei/ in the present 
tense. In the absence of such evidence we must assume that they 
'heard' /!/ and thus that their lexical representations for class I 
verbs also had /!/. 



References 

Andersen, H. 1972. Abductive and deductive change. Lang . 40:765-793. 

Borden, A.R. 1982. A Comprehensive Old-English Dictionary . 
Washington: University Press of America. 

Bybee, J.L. 1985. Morphology; A study of the relation between meaning 
and form . Amsterdam: J. Benjamins Publishing Co. 

. 1986. On the nature of grammatical categories: A diachronic 

perspective. ESCOL'85 . pp. 17-34. 



ERIC 



128 



- 122 - 



Campbell, A. 1959. Old English flr»Tim»r . Oxford: Oxford University 
Press. 

Esper, K.A. 1973. Analogy and association in linguistics and 
psychology . Athens: University of Georgia Press. 

Janda, R.D. and B.D. Joseph. 1986. One rule or many? Sanskrit 

reduplication as fragmented affixation. ESCOL'85 . pp. 103-119. 

Kiparsky, p. 1968. Linguistic universals and linguistic change, in 

Bach, E. and Haras, R.T. (eds.) Universals in linguistic theory . 
New York: Holt, Rinehart & Winston. 

Lass, R. and J.M. Anderson. 1975. Old English Phonology . Cambridge: 
Cambridge University Press. 

Levin, S.R. 1964. A reclassification of the Old Bnglish strong verbs. 
Lang . 40:156-161. 

Moder, C.L. 1986. Productivity and frequency in morphological 
classes. BSCOL'85 . pp. 176-185. 

Prokosch, B. 1939. A comparative^ Germanic grammar . Philadelphia: LSA. 

Stratmann, F.H. & H. Bradley. 1891. A Middle English dictionary . 
Oxford: Oxford University Press. 



ERIC 




OSU WPL 34 (1986) 123-126 



The Etymology of bum: Mere Child's Play* 

Mary B. Clark 
The Clintonville Academy & Otterbein College 
and 

Brian D. Joseph 

The etymology of the word bum in the meaning 'buttocks, bottom' is 
generally considered to be uncertain (e.g. by OED: s.v., by Onions et al. 
(1966: s.v.), and others). 1 One of the leading possibilities, though, is 
that £um is somehow a contraction of bottom (so Century Dictionary & 
Cyclopedia (1906. s.v.), Partridge (1966: s.v.), Morris and Morris (1977: 
s.v.)). This etymology has been denied, however, by the OED for two reasons: 
phonetic difficulties in the development from bottom to bum and the 
"historical fact that 'bottom' in this sense is found only from the 18th 
cent." (p. 1173), while bum, in its Middle English form bom, occurs as early 
as the 14th century (in Ttcvisa Higden Rolls , from 1387)7^ 

There is little to say about the attestation question; we note only that 
attestation is often merely a matter of chance and it is quite possible that 
bottom referring to 'buttocks' might have been omitted from Middle English 
texts now available for reasons other than its nonoccurrence in the usage of 
time. 3 Also, the shift in meaning from 'bottom (in general)' to 'bottom part 
of a seated person' seems natural enough to have been able to have occurred 
independently at several times in the history of English. 4 

The phonetic "difficulties", however, can be dealt with, and it is that 
side of the etymological connection of bottom with fcum that we wish to address 
here. It seems that the main objection to the phonetic derivation of bum from 
bottom stems from the fact that such a reduction or contraction was not a 
regularly occurring process in the historical phonology of English. In 
particular, bottom has continued into Modern English alongside bum , giving the 
current doublet, and there are words with a similar phonetic shape which have 
not undergone this "reduction", such as bottle and bodice , both attested early 
enough in English to be relevant to the matter at hand. 5 Thus if hum is 
derived in some way from bottom , it would have to have arisen in a dialect 
other than the one(s) providing the main input into standard Modern English. 

A solution to these difficulties was suggested to us through observation 
of the usage of our older son David. At the age of 2 years 4 months (in late 
1982), we heard David, while being diapered, say [ba(?)am], with a clear 
reference to the part of his anatomy we were most concerned with, i.e. his 
bottom. The glottal stop was somewhat weakly articulated, so that it was 
almost absent to our ears, and in fact later repetitions of the word may not 
have even contained it (hence the parentheses in our transcription). The 
resulting utterance sounded remarkably like adult bum . Moreover, the process 
responsible for the reduction evident in David's pronunciation of bottom seems 
to have been a regular one in his speech at that time. 6 During approximately 
the same period in his development, we heard [bawPl] for bo ttle , [II 31] for 
little , and [pgtmbeyar] for Paddington Bear (with assimilation of n to m), all 
with a medial dental stop? "reduced" and the resulting word "contracted", with 
some alteration of the vowels, when compared with the adult version. The 
regularity of this process in David's speech is shown also by the fact that at 
a later stage of development, 2 years 10 months, all of the above words which 
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had appeared in "reduced" form six months earlier cane to have a medial dental 
stop in them. 

Thus it is evident that there are "dialects"— or, more accurately, 
sociolects— in which the phonetic reduction of bottom to something like bum, is 
perfectly regular. We propose, then, that adult English bum has its origins 
in child speech, especially in the relatively early stages of its 
acquisition. The occurrence of bum in adult English would then result from a 
form of dialect borrowing, fostered by the opportunity for close and frequent 
adult-child interaction that diapering provides. The parent would thus be 
using and incorporating into his/her own speech a true child language form. 

As with any borrowing— dialect or otherwise — or neologism, the entry of 
bum from child language into adult speech needs only to have occurred once, 
though the possibility of recurring borrowing at several points in the history 
of English cannot be discounted. 8 Once a part of the mainstream dialect, the 
retention and spread of this word becomes a matter not of dialect borrowing 
but instead of the regular lineal transmission of langauge through subsequent 
generations. However, one interesting aspect to this proposed 
borrowing— whether it occurred once or many times — is that it is entirely in 
keeping with the Keogrammarian view of sound change, in which dialect 
borrowing can be an explanation for apparently irregular sound changes in a 
given speech community. Here the donor dialect— child language— had the 
phonetic reduction regularly and the borrowing into adult language led to the 
bum/bottom doublet and the seemingly irregular and sporadic sound change 
linking the two. 

Moreover, parallels can be found for the type of development suggested 
here for bum. The word tummy , for instance, is universally accepted (e.g. by 
QED, the American Heritage Dictionary , Partridge, etc.) as being in origin a 
nursery form or infantile alteration of stomach ; its use by. adults is 
especially common when they are talking to children, but it has penetrated 
into adult speech sufficiently to form the basis for a product name ( Turns ) and 
advertising slogan ( Turns for the tummy ). Similarly, bye-bye , as observed by 
Dilkes 1983, is another nursery word — here probably an adult conventionalized 
child form rather than a form originating with children — which has made its 
way into adult use. 



*tfe would like to thank Zheng-Sheng Zhang of the Department of 
Linguistics of the Ohio State University for his help with some of the 
research on this paper. 

1. Some sources venture no opinion at all; Webster's 3rd, for example, 
has no comment on the etymology of this word. 

2. The actual citation with the one attestation runs thus: 



Notes 



He hadde many ^ere?e evel p'at hatte ficus, ^at is a 
schrewed evel, for it seme* ^at his bom is oute >at 
ha^^at evel. [6.357] 
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This is apparently the only Middle English occurrence of bom, for both the MED 
and the OBD list only this lone example. 

3. It is possible that the QED was somewhat precipitate in its 
pronouncement that bottom meaning 'buttocks' dates only from the 18th 
century. We suspect that in A Midsummer Night's Dream (1595), the name of 
character Bottom may be part of an elaborate Shakespearean pun which plays 
upon a weaning of •buttocks' for common noun bottom . In the first place, 
Bottom is given the head of an ass (donkey); with the contemporary (American) 
senses of Bottom and Ass, i.e. bottom = •buttocks' and ass = , buttocks; 
donkey*, there is a reasonably good pun playing on Bottom's name and his 
fate. The double sense of a§s, though, is the result of a sound change 
merging ass Monkey' with the MB and BNE arse 'buttocks' through the loss of 
preconsonental r (with some vowel changes as well, most likely). However, 
there is a strong possibility that arse had an r-less pronunciation as early 
as Shakespeare's time. Barber (1976: 319) points out that "/r/ was lost in 
some non-standard forms of speech in 1MB, especially in eastern dialects and 
in substandard London speech ... before /s/ and A"/". For example, Barber 
continues, "in Troilus and Cresa ia. Shakespeare uses the word tercell 'a male 
hawk' [while] in Romeo and Juliet , the same word occurs in the form tassel 1 ". 
Both r-ful and r-less forms of arse are likely to have coexisted in 
Shakespeare's time, increasing the possibility of a play on arse/ass . Now, 
according to Rowse (1978: 1.232), Bottom's name is already to be understood as 
a pun on his occupation as a weaver, bottom referring to "a skein on Which the 
weaver's thread is wound" (and we note a] so that weavers of necessity spend a 
lot of time sitting); it therefore is not unreasonable to suggest that 
Shakespeare, as an ardent and often ribald punner, may have intended a double 
pun, pivoting on the relationship between Bottom's name and his identity with 
ass (= 'donkey' and 'buttocks'). Diagrammatically, then, the relations in 
these puns are: 



The success of these puns depends on a meaning of 'buttocks' for both bottom 
and ass (through the near—homonym in the nonstandard pronunciation of arse). 
Thus we conclude that the meaning 'buttocks' for bottom was available somewhat 
earlier than the OBD suggests, at least as early as the late 16th century. 

4. David Stampe of the Department of Linguistics of University of 
Hawaii has informed us that he has found virtually the same semantic 
connection in the Munda languages of India that he has worked with, adding 
plausibility to our claim that it is indeed a natural connection. We note 
also that among the meanings given for Middle English botme in the MED are 
several from which a shift to the meaning 'buttocks* would be fairly 
straightforward; especially relevant are the meanings *the part of a bodily 
organ farthest from the exterior; bottom of the stomach* (p. 1077). 

5. The putative "reduction" involves elimination of the medial 
consonant as well as alteration of the vowels. The source we propose below 
addresses both of these matters. 



weaver's skein < > 



Bottom < > ass ('donkey') 



(bum) < — >*[a:s] ('buttocks'). 
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6. We know of several other mothers who have also noted a similar 
pronunciation of bottom from their children, making it likely that such a 
reduction is characteristic of children's speech in general. 

7. We hesitate to label this stop, for it is unclear to us whether the 
input to David's speech included a /t/, /d/, or flap /D/ in these words. 

8. This is especially true if the reduced form is a common child 
language pronunciation; see footnote 6. 
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Snail Group Lexical Innovation: Some Examples 



Christopher Kupec 



The slang of small groups is usually ignored or dismissed by writers of 
popular etymology books and slang dictionaries. Scientific linguistic 
journals as well do not often deal with this subject. 1 Yet it is a valid part 
of the linguistic competence of the individuals using this slang, and as such 
deserves some attention by linguists. For one thing, individuals, and not 
speech communities, are the creators of new words. And individuals create them 
within the setting of their own social groups. Moreover, if we study this 
locale of lexical innovation in more detail, we can discover a number of 
unusual processes of word-formation. We can also gain some insights into the 
social side of language change, and in particular, into the social mechanisms 
involved in the creation, acceptance, and usage of innovative words. 

Some examples of small group lexical innovation are provided in this 
paper in order to undermine the notion that slang uses only typical processes 
of word- format ion (e.g., compounding, prefixation, suffixation, semantic 
radiation, concatenation, etc.), and to investigate the social side of 
word-making. 

The following examples were coined and are used by a circle of ten or so 
c}ose friends. All members of this group are in their early twenties, 
although some of the words they've created originated during their high-school 
years. Where possible, the various creators are identified (by initials), in 
order to make it easier to see who creates most of the words. 

1. Orthographic Origin 

bicth fblk6] n. 'a person* Used jocularly and derogatively. Created when M. 
attempted to write bitch backwards on the inside of a steamed-up car 
window, so it could be read from the outside. Note that the change 
affected the meaning melioratively. The (kg] cluster is nonexistent in 
standard English (Cohen 1952: p. 74). 

said-ass case [std >s keis] n. *a confused, pathetic individual', from a 

spelling error in a letter that M. wrote to another member of the group. 

music for monkey n. 'music' A spelling error of a songtitle, 'music for money', 
written by M. on a cassette case. 

nelk neek (n£lk nik] interj. A nonsense, identification word. 2 Originally 
neek-neek. an interjection used by Jack Nicholson's character in Easy 



Rider after taking a swig of whisky. M. had the word ironed onto a 
T-shirt in captal letters, NKEK NEEK , and after several washings the top 
two horizontal bars of second E wore off, leaving NELK NEEK . Created by 



nice teds n. 'nice breasts' A misspelling of ni ce tits by t. Palestinian friend 
of the group, written on a Valentine's Day card for little children. It's 
possibily his phonemic representation of a word he'd never seen in print. 
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2. Phonetic Fortition 

jgoJi joojeh [dzud£j» (dzu-dz/v] n. *a person; a guy' Sometimes used as a 
nonsense, identification word. D.'s emphatic pronunciation of dude , an 
unpopular word in this group. Jooj or its variant form j oojeh , however, 
is acceptable. Possibly this is because it isn't dated or common, and 
because it disparages the people who do actually say dude . The phonetic 
changes appear to be fortitive, since [d] is turned into an affricate 
[d£], which is very expressive (as evidenced in Jesus , jeez , jerk , and 
jism ). and extremely forceful. The variant joojeh seems to have arisen 
from a reinterpretation of the vocalization after the release of the 
second affricate as a full vowel. Note that joojeh has its stress on the 
second syllable, underlining the fact that it is a fortition. 

looz ar (lu*zar] n. 'an imbecile; a moron' M. 's pronunciation of loser . The 
r-colored schwa in loser has been reinterpreted as [ar] when both 
syllables are given equal stress. From this, it would seem likely that 
/ar/ is M. 's phonemic representation of the agentive -er ending. A 
variant exists in a front-clipped form, zar . 3 

hroach ; heroach [hrouc]; [h roue] interj. 'I got burned' or 'You got burned 
(by the speaker or someone else)' M. and G.'s pronunciations of roach , 
'burnt' or 'ruined'. The origin of the initial [h] is uncertain^ In 
the variant heroach . the transition from the word-initial [h] to [r] has 
been reanalyzed as a vowel, possibily because of the trouble in 
pronouncing [hr] as a word-initial consonant cluster in standard English.' 

skrc h [skrc] interj. 'scratch; ouch' Created by V. Said by him to sound 
more onomatopoetic than scratch . ^ 

3. Front Clipping 

splud fspl*d] n. 'a kidney punch* & interj. 'take that!' Created by M. by 
clipping 'Piss blood!' It's apparently unrelated to the British 
English curse, ' sblood for 'Christ's blood*. Notice how the removal 
of the verbal element brought on the treatment of splud as a noun. 

medic PmK'dlk] interrogative, 'wan ay dick?' Created by K. for use in public 
by clipping 'Want my dick?'. Along with the pronunciation, the stress 
changed as well to coincide with medic meaning 'corpsman'. 

zar [zar] n. 'a jerk* See loozar above. 

4. External Source 

hojoba phou'dzVb*] interj. An idencification word. It comes from jojoba , an 
ingredient in certain shampoos. The pronunciation however is neither 
straight from the Spanish word jojoba nor a direct Anglicization 
((ho'ho'ba] and [•dzowdza'bo], respectively). Rather it is a merger of 
the two. Originated with K. and N.. 

neek neek [nik nik] interj. An identification word. Originated with M. See 
ne Ik neek above. 
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5. Word Manufacture Ex Nihilo 

weep wa, weep wawl fwip«wa]; fwip wa*] interj. An identifacation word. 
Believed to have been created by N. 's brother-in-law. 

woob fwub] vt. 'to deceive, toy with', vi. Ho wobble, waver', & interj. M. 
originated it, but he doesn't remember how. The verb is completely 
regular. The intransitive verb's meaning seems to have been influenced 
by the similarly spelled word wobble . WOOB was what M. had ironed onto 
the other side of his NEBK NBEK T-shirt. 

schweibik fswei'blk] interj. An identification word. Created by N. (along) 
with schwebel, scheben, and schlabonowitz ) while daydreaming in his 
German class. He wanted German-sounding nonsense words, although he 
wasn't sure if they meant anything in German. But it seems that they're 
nonsense words in German as well. 7 The four words are often used in 
a catch-all phrase, ' Schwebel, schweibik , schlebe n, schlabonowitz '. which 
is spoken as if it were a verb conjugation. 

schwebel. (swei'bl) interj. An identification word. See schwei bik above. 

schleben fslei'bn] interj. An identification word. See schweibik above. 

schlabonowitz Isla 'Wnj'wlc) interj. An identification word. See schweib ik 
above. 

6. Malapropism 

dispo se v. 'to suppose; to guess* A play on suppo se, created by M. 
reconcile v. 'to guess, to reckon' Created by N. from reckon . 8 

7. Semantic shift 

Jack jizz pn. A nickname for M. 's father. Originally Jack jizz meant M. 
(the reasoning being that in common slang, jizz means 'semen', so 
Jack, Jizz meant Mack semen', i.e.., M. whose father's name is Jack). 
But through time, the meaning shifted back to 'Jack', so the second half 
of the nickname is just added baggage. 9 The reasons why this happened 
are unclear. Jizz was tried as a regular patronymic in this group, but 
only Jack jizz has survived, however, so perhaps it is because of its 
alliteration, and its being influenced by expressions like Jackshit and 
Jackdick. 

Nelson pn. A nickname for K. Since another of K. *s nicknames is The legen d, 
this nickname is a hidden pun. It is dependent on one's knowing the name 
of a park in northeastern Ohio called "Nelson's Ledges" or 'Nelson 
Ledges', which sounds similar to Nelson Legend .-- 

Wisner Road Resident n. 'a melonhead* Specifically it is a nickname for B. 
Where this group comes from, there is a local legend about melonheads : 
inbred, insane albinos with huge heads who come out at night to wreak 
havoc on anyone foolish enough to drive down Wisner Rd., where 
supposedly these creatures live. Calling someone a Wisner Road Resident 
is the same as .calling him a melonhead . 
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The Nightmare pn. A nickname for M. 's old Plymouth. When Lee Iacocca started 
going on TV to advertise his company's cars, there was a joked-about 
phrase, 'a Lee Iacocca dream-car', for any car made by Chrysler. M. owned 
an old Plymouth Fury that was a piece of junk, so instead of calling it 
'a Lee Iacocca dream-car', he called it a Lee Iacocca nightmare , or for 
short, The Nightma re 

8. Metathesis 

kirsheds fkjTs'c] n. 'crushed red pepper' Created by K. Actually this was a 
speech error, but K. insists trat he meant to say it, and this could be 
true, since crushed s has never t'jen used by this group. If crusheds were 
pronunced [!tr/s$c], it would seem easier to say kirsheds (k/sc], since 
(kr sc] is a more complicated artKulatory sequence than (krs'c] (i.e., 
back to front to back to front vs. Lack to front). 

The processes of word -formation illusv r ated here emphasize the variety 
of methods available and the prevalence of unusual paths in lexical 
innovation. Orthographic origin, fortition, ana ^alapropism are given little 
or no status in most published works on word- formation. Sentential 
front-clipping is not a common process either. As for metathesis, semantic 
shift, external origin, and word manufacture ex nihilo : how often do we get to 
see how a word comes about, why it comes about, and who is responsible for it? 

The examples shown also let us look in on the social aspect of lexical 
innovation. The prevalence of identification words (words that have no 
defined meaning but are used to strengthen group identity and cohesiveness) 
points out how important the social setting that these are created in is to 
the members of the group. Using such words, in a sense, affirms (both for the 
speaker and for the rest of the group) that the speaker is part of the group. 
This is the same reason that much technical jargon is used in various 
circles. Everybody wants to belong. 

The h.unor of these words is also worth mentioning. New words seem to be 
accepted by the group based on their humorous content, as well on their 
productivity. If a word's not funny, it's not used. An^ once it loses its 
freshness, the word is dropped from use more often than not. This is because 
word- format ion and usage are active, self-conscious processes. Quite often 
members of the group will comment on their irritation over a word: either 
that they use it too much or that somebody else does. This habit extends 
beyond the group. Parties and get-togethers are excellent locations to attack 
dated slang and "sloppy speech" used by strangers. If a word isn't liked, the 
attacker uses it on the person continually until he or she realizes what the 
problem word is. Sometimes this method will backfire and the word is instead 
considered to be well received. A number of words have been created that way 
in this group: kirsheds . skrch, and loozar for example. The group is 
constantly on the lookout for verbal deviance in themselves and in others 
outside of the group. 

That all these mechanisms and methods need to be examined in greater 
detail is obvious. I'm not the first to notice a lack of literature and 
research on these topics. But, hopefully, this paper gives others some ideas 
and concrete examples concerning lexical innovation and its social setting. 
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Notes 



1. Maledicta is a major exception to this trend. 

2. Identification words are usually nonsense words that are used to 
fortify relationships within a group. They delinate who's who. Nonsense 
baby-talk used between lovers can also be included in this term, since it 
strengthens the bond between then and it affirms the loving nature of the 
relationship. Essentially, identification words identify the group that the 
speaker belongs to. 

3. Zar works well in public, because the unknowing recipient of the 
insult can think he's being admired, cf. czar . 

4. The #h- might have arisen from the heavy aspiration given to the [f] . 
The tongue is extremely retroflexed and often the pharyngeal fricative [h] is 
used or the uvular fricative [)0 instead of [h] . 

5. It might be worth noting that all the aspirated-r words that English 
took from Greek are unaaprated in English. But since the [h] of h roach was 
desired, a vowel insertion was nade, so that the [h] wouldn't be lost. 

6. As V. put it, 'Bees go "bzzz" not "buzz"' (from a conversation). 

7. It is possible, however, that schweibik could be Schwab ig 'Swabian' 
(Swabia being a region in southwestern West Germany) in German. Schwebel is a 
surname, as in Schwebel 's , which is a bread-baking company in the U.S. 
Schleben could h*ve been influenced by secondary reduplication of the type, 
actor schmactor. i.e., leben schleben (since in German, Jeben means *to 
live'). Schlabonowitz could be a surname also. All these words are probably 
influenced by the expressive, derogatory nature of sC- in Yiddish slang (e.g., 
shtup . schlang . and schlock). 

8. Usually only used in the sentence, «I reconcile...'. The same usage 
is used with dispose . 

9. This type of semantic change is not infrequent. For example, rhubarb 
is derived from the Late Latin rha barbarum 'foreign rhubarb'. But when only 
one species was present, the 'foreign rhubarb', the classifying modifier lost 
its meaning. So maybe in the case of Jack jizz . the high number of nicknames 
for M. alcug with the lack of nicknames for his father influenced the change 
in meaning. 

10. An example of the impression that the inexact pun is funnier than an 
exact one. 
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WORD FREQUENCY AND DIALECT BORROWING 



Debra A. Stollenwerk 



1* Introduction 

That word frequency plays a significant role in the spread of language 
change was suggested as early as a century ago by Schuchardt. (Phillips 1984: 
321) In this century George Zipf (1929) has proposed relative frequency as a 
determinant of sound (phonetic) change. This frequency hypothesis claims that 
phonetic change (i.e. physiologically motivated change such as assimilations, 
vowel- re duct ions and the like) operates on high- frequency items first; all 
other change (characterized as analogical or conceptual — i.e. not 
physiologically motivated) affects low-frequency forms first (Phillips 
1984:336-337). 

That frequency is a significant factor in the spread of language change 
is supported by evidence from numerous studies. Leslau (1969) presents 
evidence in Ethiopian languages that certain phonetic changes such as 
sp i rant izat ion, elision and assimilation affect high-frequency words first. 
Fidelholtz (1975) gives evidence that vowel-reduction (a phonetic change) in 
initial syllables correlates positively with frequency. For example, 
astronomy, mistake and mosquito, classified as relatively frequent forms, may 
occur with a reduced vowel in the first syllable; less frequent words such as 
gastronomy, mistook and Muskegon are less likely to occur with the reduced 
vowel— although it is noted that residents of Muskegon are more likely to 
reduce the vowel in that form presumably because it is a more frequent form 
for them. One might expect similar behavior for the item Sandusky (Fidelholtz 



Hooper's (1976) analysis of schwa-deletion in English (yet another 
phonetic change) yielded an identical conclusion. Consider the following word 
pairs: nursery /cursory ; scenery/chicanery ; celery /artillery ; memory /armory . 
In each pair it is the more frequent form (i.e. the first form) which is more 
likely to undergo schwa-deletion. 

The evidence also suggests that it is the low- frequency forms which are 
the first to undergo non-phonetically motivated change (i.e. conceptually 
motivated or analogical change). Tt is this type of change which is the 
subject of Toon's (1978) analysis of h-deletion in Old English morpheme 
initial consonant clusters written as hn, hr, hi and hw. Here, he found that 
the low- frequency words exhibited the greatest rate of deletion. In his 
discussion of lexical diffusion in early Old English, Toon presents a model of 
sound change in which an innovation operates initially on low-frequency items, 
gradually spreads to and accelerates through high-frequency forms to 
near-completion and in the final stage leaves a small residue behind after 
reaching completion— e.g. the merger of /eo/ and /io/ and the raising of West 
Germanic *a to /o/ before nasals. That sound changes may operate initially on 
low- frequency words suggests an explanation for "either the initial state of 
low-level variable application or a completed end state with residue" both of 
which are regularly observed in sound change. (Toon 1978:362) 



1975:200). 
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Additional evidence of analogical change affecting low-frequency forms 
first comes from Hooper's study of the six verbs creep , leap , weep , leave . 
sleep and keep (Hooper 1976). Bach of these verbs has a preterite form 
occurring with a lowered, laxed vowel — e.g. crept . kept , leapt . Only the first 
three, however, can optionally take a regularized form in the preterite — i.e. 
creeped , leaped and weePed . Analysis of frequency reveals that the mean for 
the forms subject to leveling is 37 while the mean for those not subject to 
leveling is 485. Thus, the leveling (analogical change) operates on the less 
frequent words. 

While there seems to be a good deal of evidence in support of a 
frequency-effect on the spread of sound change there may also be evidence that 
frequency is a factor in dialect borrowing. What I propose to do in this 
paper is to examine the possible role of word frequency in dialect borrowing 
by presenting some evidence from my own dialect in which words that are, for 
the most part, derived historically from Middle or Early Modern English o vary 
in pronunciation. 

2. Variation in 'o** -words 

There occur in my speech different phonetic realizations of graphic o 
before /g/ such that graphic <o> is realized as either /a/ or/ /. (It should 
be noted here that /a/ a.^d /I/ are contrast ive in my speech, serving to 
distinguish, for example, cgt and caught or tot and taught.) Based on my 
auditory perceptions the distribution of these sounds before /g/ is as follows: 



PI 


M 


frog 


cog 


log 


clog 


fog 


jog 


hog 


♦smog 


dog 


bog 


♦smog 





Within this corpus of data all forms except snog and jog are derived 
.from either Middle English or Early Modern English o— thus, 

frog < ME frogge 

log < ME logge 

fog < ME fogge 

hog < ME hogge 

dog < MR dogge 

clog < ME clogge 

cog < ME cogge 

bog < OE bugan, Early Modern bogge 

(from the Oxford English Dictionary . 1933). Because of the common phonetic 
source of the vowels of these words (i.e. Early Modern or ME 6) it might be 
expected that the graphic <o> would be homophonous. The items fog and snog 
might be expected to be homophonous as well since snog , a relatively recent 
word, (the first nttestntion being 1905) is derived from fog (via a blending 
with s moke ). The word Jog appears to be relevant to the data base as well 
although its etymology is uncertain (possibly an alteration of ME shog or 
derived from Early Modern iogge (Oxford English Dictionary 1933). 1 
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smog is found with variable pronunciation, perhaps a reflection of the 
confusion I am currently experiencing over the pronunciation of this word. 
(In fact, in a recent lecture, I said the word twice. The vowel of the first 
utterance I perceived as rounded while that of the second as unrounded. 
Moreover, the second utterance was perceived by me as a correction of the 
first.) 

Spectrograph* c analysis of the syllable nuclei in these forms supported 
the intuitive distribution (see above) with the exception of smog which 
exhibited some rounding. The words, on the basis of spectrographs evidence, 
have been arranged into thtee groups: (A) consists of words in which the 
phonetic realization was /0/\ (B) consists of forms in which the phonetic 
realization was /a/; (C) consists of smog in which the intended target (i.e. 
the target at which I perceived myself to be aiming) was /a/ but which 
spectrographically exhibits some rounding. The values for Fl and F2 for each 
item are listed, as well as the mean (X) and standard deviation (SD) for each 
group. 

Table 1 





Item 


Fl 


X / SD 


F2 


X / SD 


(A) 


frog 


769.3 




1153.8 




P/ 


log 


846.2 


X=800 


1153.8 


X=1199.9 




fog 


769.3 


SD=42.1 


1153.8 


SD= 68.78 




hog 


846.2 




1230.7 






dog 


769.3 




1307.6 




(B) 


cog 


923.1 




1653.9 




/a/ 


bog 


1000.0 


X= 897.46 


1615.2 


X= 1653. 83 




jog 


769.3 


SD= 95.91 


1692.4 


SD«- 31.52 


(O 


smog 


846.2 


X=846.2 


1461.4 


X=1461.4 



Lip-rounding and labiality have the effect of lowering formants. 
Clearly, then, the items in (A) exhibit a more rounded vowel than those 
in (B) as evidenced by the higher F2 values of the forms in (A). In 
smog, where the intended target (intuitively speaking) was /a/, the vowel 
is immediately preceded by a labial consonant. F2 for these items is 
noticeably lower than the values for F2 in (B). 

So, group (A) exhibits roundedness with a mean of 1199.9 for F2; 
(B) clearly exhibits less rounding with a mean F2 of 1638.47 and (C), 
where intent and realization diverge, is intermediate between (A) and (B) 
with a mean F2 of 1358.86. 

In (B), the item bog also contains a Znbial consonant immediately 
preceding the vowel, yet there is no significant lowering of formants. 
Because of its stop-quality, however, /b/ is more unlike a vowel while 
/id/, being a sonorant, is more vowel-like. Given the intended target of 
/a/ for (C) as well as the intermediate F2 value, it would appear then 
that co-articulatory rounding is a factor in the realization of the vowel 
in (C). 
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In order to analyze overlapping I plotted on a graph the X-value for 
each group as well as two standard deviations above and below that point. 
The resulting graph showed that the phonetic realization of smog more closely 
approximates the rounded vowel than the unrounded vowel, probably a result of 
co-articulatory rounding. 

3. Frequency effect 

Referring again to the word groups of Table 1 (shown below for 
convenience) and temporarily ignoring group C, a pattern may be observed in 
terms of the frequencies of items within groups A and B. 

(A) (B) 

frog cog 

log bog 

fog jog 

hog clog 
dog 



That is, the data suggest that the items of (A) — realized with the 
rounded vowel— are of higher frequency than those of group (B)— realized 
with the unrounded vowel. (The item in (C) is intermediate between (A) 
and (B) in terms of roundedness and appears to be undergoing 
co-articulatory rounding as a result of a preceding labial sonorant.) 

In order to examine the issue of frequency, two sources giving a 
frequency analysis of English were consulted. The Kucera and Francis 
volume (1967) ranks items by means of a three-figure number— e.g. 
1-01-001 — where the first figure designates frequency of occurrence in 
thi corpus; the second figure indicates the number of genre subdivisions (out 
of a total of 15 genre subdivisions) in which the word occurs; and the third 
figure indicates the number of samples (out of a total of 500 samples) in 
which the item is found. What follows then is a ranked listing of the data in 
order of least frequency to greatest frequency: 

bog 1-01-001 
cog 1-01-001 
frog 1-01-001 
jog 1-01-001 
smog 1-01-001 
clog 2-02-002 
hog 3-02-003 
log 11-05-007 
fog 25-09-018 
dog 75-12-028 

The listing shows a clear delineation of low-frequency/high-frequency between 
hog and log and indicates as well that hog and frog (counter to my intuition) 
are of relatively low frequency. These frequency counts, however, are based on 
written usage rather than spoken usage. And, as Hooper (1976: 98) notes, 
frequent forms in written text are found to. occur even more frequently in 
spoken usage while less frequent forms in written text occur even less 
frequently in spoken usage. 
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The American Heritage Word Freq uency Book ranked 86,741 different words 
out of n total corpus of 5,088,721 tokens. What follows, again, is a ranked 
listing from least to greatest frequency where F represents total occurrence 
of the form in the overall corpus and U represents the estimated frequency per 
million tokens: 





F 


U 


groggy 


2 


.1142 


cog 


2 


.1152 


clog 


2 


.1257 


jog 


3 


.2477 


bog 


14 


1.7289 


smog 


22 


2.4598 


hog 


33 


5.0096 


frog 


171 


26.143 


fog 


212 


33.553 


dog 


1380 


231.49 



The U-figure of the righthand column shows a fairly sharp increase 
between smog and hog (slightly more than a 100* jump). If the line between 
low and high frequency is drawn here, there is accord between intuitive 
judgments and interpretation of frequency data. In any event, the item frog 
is of notably higher frequency in this analysis than in that of Kucera and 
Francis. 

Analyzing, then, the distribution of /a/ and /p/ in these forms in terms 
of frequency (the lexical item hog being the only relatively infrequent form 
in group (A)), there may be reason to assume that the variation in 
pronunciation among the relevant forms is linked to a frequency effect; that 
is, the frequent forms (with hog being interpreted ns fairly frequent) exhibit 
the rounded vowel while the relatively infrequent forms exhibit the unrounded 
vowel. 

4* Dialect borrowing 

The frequency effect which I am proposing here differs somewhat from the 
frequency effect of the aforementioned studies by Hooper, Phillips, etc. That 
is, the focus of those particular studies was the role of frequency in 
language change (be it phonetic or conceptual change) whereas the focus of 
this paper is the role of frequency in dialect borrowing. It is possible that 
the variation under analysis in this study represents language change in its 
initial stages. However, this type of change seems to be neither phonetically 
motivated nor analogical. Possibly this variation does not so much represent 
sound change motivated by internal factors as dialect borrowing motivated by 
external influence. The latter is clearly a possibility given the influence 
which various dialect areas have had on my speech— namely, Toledo, Ohio (birth 
to age seven); Jacksonville, Florida (age seven to ten years); Portland, 
Michigan (age ten to sixteen years); and Columbus, Ohio (age sixteen to the 
present). 

Marckwardt, in his study of Middle Knglish o in the Great Lakes region 
(1940:570), identifies the unrounded vowel as primarily a Northern feature 
(i.e. in my case, a feature of Michigan speech) and reports the following 
distribution for the stressed vowel in fog, foggy, frog and hog: /a/ prevails 
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throughout Michigan and /?/ is found throughout Ohio. For the items log and 
dpjj /V * s favored throughout the entire Great Lakes region. (1940: 562) 
Clearly, then, in view of the above distrioution, the /:>/ quality of the 
stressed vowel in fog, foggy * frog and hog represent an Ohio (or non-Northern) 
pronunciation. 

While the distribution of /y/ and /a/ in Michigan and Ohio is divergent 
for some forms, the status of ME o in many groups of lexical items in the 
Toledo area is identical to that of Michigan. For example, in the items 
hospital , vomit , college , cottage , stopper , pocket , God and hod the stressed 
vowel in each is unrounded in both northern Ohio and Michigan while being 
rounded in Central Ohio (Marckwardt 1940:566-569). However, this area of 
Northwestern Ohio has been analyzed as a transition area by Davis and 
McOavid — that is, an area which has experienced (or is experiencing) influence 
from two or more "directions so that competing forms exist in it side by side 
(1950:264). They found, in their five-county survey, that the unrounded vowel 
was centered largely at Perrysburg (a suburb of Toledo). However, even here 
variable pronunciation of the vowel in fog, foggy * on, oa and grandpa occurred 
(19:270). 

Distribution of these vowels was not available for clog(ged) , smog , 
bog(ged) , jog and cog . Therefore, in order to determine the pronunciation of 
the syllabics in these forms for the Toledo area (which in my speech occur 
with the unrounded variant) I selected two natives of Lucas county who read a 
set of sentences containing these forms (each sentence being read twice) from 
which spectrograms were made. (One informant was, in fact, a native of 
Perrysburg.) Spectrographs analysis yielded the following distribution: 

Table 2 



Distribution of /a/ and /V in Toledo 





Inf. 1 




Inf. 2 




smog 


/-)/ 


/?/ 


/V 


/>/ 


bog(ged) 


/>/ 


M 


/>/ 


/J/ 


jog 


/a/ 


/a/ 


/a/ 


/a/ 


cog 


/->/ 


/V 


/a/ 


/a/ 


clogged 


/V 


h/ 


/?/ 


/>/ 



Note that the only item which was consistently unrounded in the speech 
of both informants is Jog and that one informant pronounced cog with the 
rounded vowel while the other produced this form with the unrounded vowel. 
These results are consistent with the findings of Davis and McDavid in terms 
of the existence of competing forms in the area. 

Thus the status of graphic <o> before /g/ based on my own findings as 
well as those of Marckwardt, Davis and McDavid can be condensed into the 
following tabled 
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Table 3 



Michigan 


Toledo 


Self 


log 




Z> 


O 


dog 




J 


J> 


fog 


a 


a/J> 


o 


foggy 


a 


a/;> 


J> 


. frog- 


a 


0 


o 


hog 


a 


0 




smog 


a 




a/O 


bog(ged) 


a 




a 


jog 


a 


a 


a 


cog 


a 


a/P 


a 


clog(ged) 


a 


:> 


a 



The table is divided into three groups of items- -i.e. 1) high-frequency; 2) 
intermediate-frequency; and 3) low-frequency. The results indicate that /a/ 
prevails in Michigan regardless of frequency and predominates in my speech in 
only the low- frequency forms whereas / / prevails in the speech of the Toledo 
informants. Note also the clear divergence of my speech from the Toledo 
pattern in the words bog and clog where my speech exhibits the unrounded 
(Northern) vowel and that of the Toledo informants invariably exhibits the 
rounded vowel. What appears to be happening here is outside dialectal 
influence — or dialect borrowing. 

Labov (1972) discusses the dialect borrowing/restructuring which may 
occur when a speaker with an already-formed linguistic pattern moves into 
another dialect area. He identifies the formative period of first language 
acquisition as four to thirteen years of age and in his study of New York 
City* 3 vowel system in the speech of informants from the Lower Bast Side found 
ten years to be the critical, cut-off age for native speakers moving into New 
York from other U.S. dialect areas (1972;305). Thus, one would expect that a 
speaker who moves into New York before age ten is more likely to adopt the 
vowel pattern of New York than a speaker who moves into New York after age 
ten, the age at which I moved to Michigan. 

Given the the evidence from Labov's studies and the predominance of 
the unrounded vowel (a Northern feature) in only the low-frequency forms in my 
speech, it seems reasonable to propose that sometime between age ten and 
sixteen I adopted the vowel pattern of Central Michigan. Clearly, however, 
this did not represent a wholesale adoption since higher* frequency forms such 
as fog and hog, which in Michigan exhibit an unrounded vowel, in my speech 
retain the rounded (Ohio or non-Northern) vowel. Thus, it would seem that 
word frequency is playing a significant role here in the phenomenon of dialect 
borrowing. 

Antilla (1972) makes the claim that not only does frequency play a role 
in language change but in pronunciation borrowing as well. "Words with high 
local frequency tend to be the last ones to be changed. .. (because] .. .high 
local frequency acts as a barrier to change from the outside. 11 (1972: 188) In 
this case, his claim would mean that high frequency words such as fog, log and 
dog, whose forms are firmly established in the memory of a speaker, would be 
resistant to influence from outside dialect patterns. Low frequency words, on 
the other hand, such as clog or bog, whose forms are not as firmly established 
in the speaker's memory, are less stable and therefore more likely to be 
affected by outside change. 
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5. Conclusion 

It is this, then, which I propose here — that the phonetic variation 
found in what might reasonably be expected to be a hoaophonous corpus (with 
respect to graphic <o>) can be explained in terms of the role which word 
frequency plays in the process of dialect borrowing. That is, the high 
frequency forms (with the unrounded vowel) appear to have resisted the 
influence of a Michigan vowel pattern whereas the less frequent forms (with 
the possible exception of hog which may actually be of higher frequency in 
spoken usage) were more susceptible to outside influence. It may well be that 
word frequency figures significantly not only in language change but in 
synchronic variation as well. 

Notes 

*J am very grateful to Keith Johnson for the hours of patient listening 
and assistance he gave me— especially with word-processing and phonetics 
dilemmas. Hopefully, the incessant interruptions are over. 

1. The corpus might reasonably be expanded to include other 'og* -words 
such as soggy , groggy and eggno g which, even though not historically derived 
from ME o, are graphically identical and say be rhymed with, for example, 
foggy * 

2. The results for items smog- clog(ged) in (Central) Michigan are 
based on my own auditory judgements and are consistent with the 
predominance of /a/ in that area for not only the forms in 1) and 2) but 
graphic-<a> words like want , watch , ma and pa and other graphic-<o> words 
such as vomit , on, cottage and pocket as well (Marckwerdt 1940). 
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Introspection into a Stable Case of Variation in Finnish* 



Riitta Valiaaa-Blua 



One of the sore coaaon signs of language change is vacillation 
between two competing form. But once the change has run its course the 
older font typically disappears frosi the language. This, however, is not 
always the case: sometime* the archaic form remains as a stable variant 
of the newer, unaarked fora due to, e.g., borrowing froa spoken and/or 
written archaic dialects. In a study of language change in apparent tiae 
these coexisting alternants would have to be taken to signal ongoing 
change but this conclusion would be false nince the vacillation in only 
seeaing. The' present study explores one instance of this kind of 
apparent vacillation not signalling change in Finnish. 

When reading a chapter on the changes in Finnish phonology 
(Hakulinen 1979) I realized that aany of the older foras are still (in 
soae sense) in ay vocabulary. According to Hakulinen soae of these still 
exist in conservative dialects (presuaably as the unaarked foras); soae 
are aarked aa archaic but aany fc**ve completely disappeared froa aodern 
Finnish. Below are listed soae of the foras aentioned in Hakulinen which 
I do find in ay stylistic repertoire, which naturally cannot be totally 
idiosyncratic but is necessarily shared: 



ARCHAIC NEW 



'without fish' 

'aoors' 

'of the prey' 

'caae' (impersonal) 

'of the aale' 

•grey' 

»I open' 

'would coae' 

'would go' 

'he goes' 

'of the pine trees' 
'of the rye' 
'may it coae' 
•hair' 

'on the eggs' 
•horse' 

'errand; matter • 
•skin* 



Tne wora pairs anove were given «i w™. — , 

assuming that a naive speaker at one particular point in tiae is not 
aware of *He diachrony of his language, we could say that the second fora 
in each ?-ir is the basic synchronic fora froa which the first alternant 
is a deviation. For soae of the 'deviations 1 we can write synchronic 
phonological rules by which these foras are derived froa the new one. 
The fora of the synchronic rule probably reflects the inverse of the 
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(1) 


kalatoin ~ 


kalaton 


(2) 


kankahat ~ 


kankaat 


(3) 


saalihin ~ 


saaliin 


(4) 


tultihin - 


tultiin 


(5) 


urohon ~ 


uroon 


(6) 


harmaja 


haraaa 


(7) 


avajan 


avaan 


(8) 


tuleisi - 


tulisi 


(9) 


aeneisi 


menisi 


(10) 


aenevi ~ 


aenee 


(11) 


honkaen ~ 


honkain 


(12) 


rukihin 


rukiin 


(13) 


tulkohon ~ 


tulkoon 


(14) 


hivus " 


hi in 


(15) 


muneilla ~ 


aunills 


(16) 


he vo in en ~ 


hevonen 


(17) 


asja 


asia 


(18) 


hipja 


hipia 
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historical relation obtaining between the item* in questions. For 
example, we could write a rule like 

/ — > h/V VC# 

to five fonss like (2-5). Or, a i-inaertion rule for form like (6) 
and (7). But for many of the words it is difficult to formulste mles 
since they appear to be isolated elements and no generalizations can be 
extracted froa theai. Thus the archaic part of jsy language contains 
archaic words per se, and generalizations by which archaic vocabulary 
item are created froa the Modern form. 

Not Vacillation but Borrowing 

Do these archaic item exist in ay speech because they are the older 
nora form which are now disappearing? According to the source 
(Hakulinen), as was ststed above, some of the form are still used in 
conservstive dialects; som are archaic, but some have completely 
disappeared. My dialect is not one of the conservative ones and none of 
the item above is part of this dialect in the usual neutral synchronic 
sense. They are never used in lieu of the unaarked form in unmarked 
discourse, not by ay generation, nor by any other generation in the same 
dialect group - there is no vacillation between two coapeting unaarked 
form involving these item. Thus it aeem that they do not exist in ay 
speech because they are the disappearing older variants of the modern 
form. What is their source then? I propose that the source is 
borrowing froa conservative dialects (with which I aa not intiaately 
faailiar) and froa literary sources like the Kalevala, and other folk 
poetry* and folk songs. 

The reason why the literary works Mentioned above are seen as a 
possible source is that every Finn has to read at least parts of the 
Kalevala at school at soae point or another; also other folk poetry, such 
as the Kant ele tar, is studied, and folk songs are sung in music education 
classes. And, what is more important psychologically, these works are 
highly valued - they are soae of the "national treasures" and as such 
they belong to the whole population. This fact makes them "common 
currency", since everybody is exposed to samples of these works, they 
would be an obvious candidate for the origin of the archaism under 
consideration. 

Naturally, it is difficult to pin down ex~.\ily the aource froa which 
a lexical itea has entered one's mental lexicon. I may have learned 
them, for example, from my schoolmates or relatives or mass media, etc., 
but the ultiaate source of the old form is proposed to be the archaic 
language of above mentioned and comparable works, and possibly 
conservative dialects. 

Dowty has proposed that speakers are "potentially capable of 
remembering thst they have heard a newly derived word for the first time, 
in a way that they very rarely recall hearing a sentence for the first 
tiae" and thus "speakers are able to distinguish between actual and 
merely possible (but well-formed) words in a way that they are not able 
to distinguish between actual and aerely possible sentences" (1978: 
120). These facts suggest to Dowty that lexically derived expressions 
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would be learned individually, unlike sentences. I do not agree with 
this for all lexical acquisition but it is cert sin ly plsusible in the 
present case. The deviant derivations are learned by persons who already 
ere fluent speakers of Finnish. At this time they are, however, learning 
something that they know is deviant, and maybe for this reason some 
features of the source get sttsched to the set of connotstions of the 
non-normal forms. 

It is known thst the writing aystem for a language can change 
language st the phonological and even the Morphological level (Polome* 
1981) but more relevant for the case on hand is that written Materiel can 
function as s source of borrowing. Bloomfield (1933) cites exanples of 
dialect borrowing of this kind which actually have led to change. 
Misunderstood archaic words «ay introduce new words or new Meanings for 
words, e.g., derring do was 'daring to do' but becaMe 'brave actions' 
(ibid., 487). But even some obsolete words May becoMe reintroduced into 
the language froa written sources: sooth and guise are examples of such 
words in English (ibid.). 

Spelling pronuncistion of English is one of the sources of variation 
and change comparable to literary borrowing: often is often pronounced 
with a [t] . Also completely new words come to language from "**iting: 
Corns omol , prof , end lab are such items. Bloomfield's examples make it 
indeed plausible that the ultimate source of these now archaic forms of 
Finnish could be the Kalevala and other similar written works. 

The first page of the Kalevala gives us some of the forms mentioned 
above (the trans 1st ion comes from an English version of the collection 



The first two lines give samples of type (10) given on the first page: 



(10) menevi ~ menee 

And lines (3) - (6) illustrete types (2) and (3): 

(2) kankshat ~ kankaat 

(3) ssalihin ~ saaliin. 

Line {J) has a form parallel to 

(11) honkaen ~ honkain 



(1963)): 



1. 

2. 
3. 
4. 
5. 
6. 
7. 
8. 

a. 

10. 



Nieleni minun tekevi 
Aivoni ajattelevi 
Lahteani laulamahan, 
Saa'ani sanelemahan, 
Sukuvirtta suoltamahan 
Lajivirtta laulamahan; 
Sanat suussani sulavat, 
Puhe'et putoelevat, 
Kielelleni kerkiavat, 
Hampahilleni hajoovat. 



It is my desire, 

it is my wish 

to set out to sing, 

to begin to recite, 

to let a song of our clan to glide on 

to sing s family lay. 

The words are Melting in My mouth, 

utterances dropping out* 

coming to my tongue, 

being scattered about My teeth. 



OLD 



NEW 
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Later in the Kalevala (1943, 44) we have 

Sip silking hivuata! 'tie up your hair with silk ribbons' 

where we have the archaic hivus as i n (14). 

Actually even whole phrases are used in the same stylistic contexts as 
the other form: 

Vaka yjBhfl Valnamoinen 'steadfast old Vainuoinen' 

This Phrase i8 repeated over and over again in the Kalevala. A thorough 
search would probably give citations of *>st of the olc^r form, if not 
in the Kalevala, then in some other archaic work which is still available 
and read today. 

. D • r ?iSS er V ^Kalevalaic words in the above mentioned chapter 
by Hakulinen (1978: ch. 2) which I use but for which the stereotypic 
be discussed below) is somewhat fuzzy, l.t seem that while using these 
one is trying to evoke a foreigner image or an image of aomeone not quite 
" Mte ™f W»n«h- Ihese are, I believe, also borrowings from some 
? B,f ^C i written works. For font (20) below, for example, Hakulinen 
(p. 38) gives as one source a document from the 16th century (Agricola) 
•fT? e4l 1 0f , which are **** duril1 ' M«*ory ceases. These form are used 
stylistically like the other archaic form and in that sense they have 
the same status as the Kalevalaic form and therefore they are alao 
problematic in the same way, as we will see below. Some 0 f these other 
form are the following: 

OLD NEW 

(19) vodottaa - >dottaa 'to wait' 

(20) soo ~ ayo 'he eats' 

(21) too - tyo 'work' 

(22) nyyt - nyt » n0 w» 

Well-defined St ylistic Function and Value 

Synchronically these older form have an archaic and humorous value 
while the «odern forms are unmrked. Thus they would aeem to exemplify 
Kury*owicz'a f ourt h "law" of analogy: when two Morphemes are 
undergoing differentiation the new form corresponds to the basic function 
while the older one has a derived, secondary function (1945). For 
example, the extension of the archaic [hevoinen] 'horse' is the same as 
that of the new [hevonen] but the intension in somewhat funny. The same 
holds for all of the pairs and thus the older elements are relegated to 
some special functions alone. The phenomenon is comparable to the one 
Se./ie refers to in his question N Knowest thou him who cslleth himself 
Richard Nixon?" This question gets a different response than the 
following: "Do you know Richard Kixon? H (1975: 76). The hearer 
understands the intended meaning and responds accordingly, i.e.. 
differently. 

All of these archaic item belong to specific stylea: mainly (in 
Joos's term) to the intimate and casual styles. The use of these styles 
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involves intiaate friend* and/or aeabers of a group equal in teras of age 
or social status, and wore or less inforaal situations. Thus these 
archaism occur only in liaited, non linguistically definable contexts. 
But they are nevertheless widely shared since, for example, they can be 
heard on TV and radio shows. The function of their use is that of humor, 
e.g., to announce "let's keep the situation relaxed and inforaal", but 
their specific Meaning or connotation has to do with something we Might 
call Kalevalaic (excluding the "fuzzy" set (19) - (22) froa Agricola's 
times) ; the stereotype being evoked has a Kalevalaic character. I do not 
wish to claia that each single itea above has exactly the saae stereotype 
for every user but that a significant subset of thea is shared and this 
subset hss a Kalevalaic flavor. 

Psychologically there is a difference in the stereotypes between the 
archaic foras and those of aoae synchronic dialect foras. For exaaple, 
words like (23) -(28) are at present unaarked foras in dialects other 
thar *ne. 

t »ie, aaa = aina 'I* 

' } koloae = kolas 'three' 

(24) pelakka = palkka 'salary 1 

(25) ayara = syoda 'to eat' 

(26) tehra = tehda 'to do' 

(27) hyo = he 'they' 

(28) has = hao 'ahe/he' 

The use of these evokes different stereotypes than the archaic ones. In 
this case the character io clearly synchronic and actually even 
geographically circuascribed. Thus in both cases the foras are used for 
stereotyping hut the iaages being evoked are different. And this would 
fit neatly with the ideas of Dowty's aentioned above: the initial context 
of learning ia reaeabered and consequently it has becoae part of the 
derivation. 

Scott (1962) has discussed what she calls 'vivid language*, and its 
role in language change. Vivid language, according to her, is 
characterized by intentional but interpretable deviance, and novelty. By 
this definition the archaic foras clearly count as vivid language: they 
are intentionally deviant and also interpretable, and they have novelty 
value, at least in particular contexts. Scott gives exaaples of how 
vivid language Bay becoae conventionalized and result in changes in a 
language. Thus vivid language ia relevant to historical studies and 
therefore vivid language, like different styles, aust be part of the 
corpus in diachronic explorations. I will return to this point below. 

Unlikely Future Kyriolexia 

If the archaic foras under consideration indeed have becoae 
conventionalized, and since they exist alongside the new foras, is it to 
be expected that thay becoae one day the kyriolexia (Householder 1963) or 
the modern unaarked foras? Householder dicusaes how a speaker has a 
subconscious nor* (or as I would call it, a pragaatic testate) against 
which lexical items are Batched when they are heard or used. One itea 
typically ia the fc «ra or kyriolexia (16. hevonen.) while the other (16. 
heyoinen) ia the tolerated deviation ,(cf. Kury*owicz and Scott above), 
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and when the tolerated deviation is "promoted" i t "overthrows" toe 
previous nons and becomes the new kyriolexia. 

Since these archaic forms are stylistic variants, i.e., vivid 
language, and their use is nonlinguistically governed, and tbey are not 
tbe kyriolexia, is it to be expected that they will become tbe new 
noma? It would not be impossible, but in the present case this would 
sees very unlikely because of their well-defined stylistic function, 
•trong archaic flavor and huaorous value. Ihey are not in competition 
with the new forms but have a clearly assigned place in the language 
use. But on the other hand, they are not likely to disappear either, 
partly due to their above-mentioned function, but also because their 
proposed sources are continuously available and tbey enjoy some kind of 
prestige. 

In-group Prestige 

It is commonly assumed that the aource of borrowing possesses 
prestige, but as Labov has made it clear, prestige is not a 
unidimensional concept. He stresses that language change in general and 
thus also borrowing cannot be understood apart from the aocial aetting 
wnere it takes place (1978:23). Social meanings thus get attached to 
certain alternative forms, snd, to <*uote Sturtevant, theae "rivala ahall 
acquire some sort of prestige" (ibid.: 3). But the crucial point is thst 
the prestige lies in tbe eye of the beholder - if someone is perceived as 
possessing some desirable feature(s) he may well be imitated regardless 
of bis actual social standing. 

How would prestige explain the present case of borrowing? As stated 
above the K levalaic works are highly valued by tbe Finnish people in 
general; t over, there is some degree 0 f identification with the 
Kalevala ae historical sense, for the Finns sometimes address 
themselves as the Kalevalaic people: 

"Mehan oll aan tammosts Kalevan/kalevalaista kansaa t " 
we are thia Kaleva's/Kalevalaic people 
"He are, after all, this aort of Kalevalaic people" 

The identification is in the sense of continuity, in the sense of sharing 
something that forms one of the roots of "Finniaboess". Even if I speak 
here of introspection, the forms are shared, as was pointed out sbove, so 
that they can even be heard on TV and radio ahows. Maybe tbere is 
something similar going on here as was on Martha's Vineyard (Labov: 
1978): these archaic forms with their Kalevalaic stereotypes are shared 
linguistic conventions, and they are used to convey some kind of humorous 
in-group sentiment. Their use is stylistically governed but they are 
part of the stable linguistic repertoire; the forms are used only with 

insiders". But Finlsnd ia small snd culturslly relatively homogeneous, 
and all the Finns are apparently seen to be insiders. 

Relevant Data from Styles 

Hammerstrcm suggests thst "in synchrony, neither "older" nor 
younger M forms... nor chsnge can be conaidered, as they are not part of 
that knowledge which is used in cosmunication" (1982: 51). For hi» the 

o 
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use of older forms of a language, or of foreign word* in one's speech, is 
■ore like a case of Mention rather than of use, and thus mist be 
eliminated fro* the synchronic description of a language. As has been 
shown above the archaic fonss are used in "true" communication, in true 
social discourse; they are not instances of Mention at all but carry 
their own intended, situationally relevant Meanings. 

Buccellati notes that too often anything "formally bizarre" or 
unexplainable is assigned to stylistics and thus discarded from the data 
(1981). For Buccellati, style is not a synchronic phenomenon either, but 
exclusively a diachronic "distributional category": "a recurrent 
selection of relatively idiosyncratic features" (1981: 808). In 
linguistics -these features can be morphological, syntactic or lexical 
items "sufficiently distinctive to acquire stylistic value" (ibid.: 
809)* Idiosyncracy in this definition refers to the distinctiveness of 
the items within a given "assemblage" of language as opposed to another 
"assemblage". Here styles can be seen as particular assemblages and the 
archaic items have distinctive value in some styles since they are not 
found in all styles. 

I would agree with Buccellati, and Scott above, and consider 
stylistic choices as 'vivid lsngusge' relevant for diachronic analysis, 
but contrary to both Buccellati and Bammarstram, also for synchronic 
analysis. After all, many parts of language in general could be 
allocated to certain styles slone, but we would not discard those parts. 
The use of slang snd Jargon, for example, is also stylistically 
governed. If archaic items are to be eliminated from linguistic 
analysis, so too, then, are jargon and slang. 

Problems for Diachronic Studies in Apparent Time 

Now that the srchaic forms are seen as part of the relevsnt corpua, 
their retention in the data may pose problems for a study of language 
change in apparent time. In this kind of study we need real time 
evidence to show that the part of the data being studied has been the 
unmarked data at some earlier time (Labov 1978: 275). We do have this 
evidence for all of the forms: they have been the neutral forms at some 
earlier point in time. But if a linguist now in his apparent-time study 
of Finnish finds these archaic forms alongside the unmarked forms, he 
would not have any motivation for saying that these forms are not in 
competition with the unmarked forms. Consequently he would be forced to 
conclude that he is facing the typical vacillation situation of language 
change and that Finniah ia undergoing change. Additionally, he might not 
find these elements in the speech of the very young population which has 
not yet been exposed to these writings and this fact would only support 
his conclusion. But this conclusion conflicts with the facts: most of 
these archaic items are synchronically living, stylistic variants of the 
unmarked synchronic forms. They exemplify the fact that a native spesker 
of a language knows a considerable range of facts about thia language. 
The sources m*y be old texts, ordinary school books, mass media, home and 
friends, etc. We cannot forget in historical or synchronic studies that 
our language is a multidimensional entity in terms of styles and 
registers and apeakera have mastery of this entity. 
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Sums ary 



We have examined the use of soae archaic lexical itews of Finnish 
everyday discourse. These items occur only in specific, 
nonlinftuistically describable contexts. They involve a Ktievalaic, 
archaic stereotype and they probably are instances of dialect borrowing, 
partly from literary sources, possibly also from conservative dialects. 
Since they are part of the stable linguistic system, they are valid data 
for both synchronic and diachronic analyses. Their coexistence, however, 
gives the wrong impression that the speakers are alternating between two 
competing forms. But this is only apparent vacillation; actually the 
phenomenon is something resembling a "Kalevalaic diglossia". Thus not 
all variation between archaic and unmarked forms is vacillation 
signalling change in progress. 



* This paper was read at the annual meeting of the Finno-Ugric Studies 
Association of Canada in Montreal in May 1985; the theme of the meeting 
was the 150th anniversary of the first publication of the Kalevala. 
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Diglossia in Ancient India 
Qina M. Lee 

1. Introduction 

The rich variety of languages spoken in Modern India, with representatives 
of •everal lanfliage families (Indo- Aryan, Dravidian, and Munda, as well aa 

^il in^V,?^^ "V^il^T* ? «-"*<*"ons of language contact in 
India, and South Asia in general. In particular, the relationship of some 
Indian lanfnafes spoken within the same speech communities has been aaid to be 
diflosaic: Oair (1968) and De Silva (1974) have proposed that the 
relationahip between the literary and colloquial varieties of modern Sinhalese 
(apokan in Ceylon) is diglosaic. 

Like its present day counterpart, ancient India waa a multilingual area. 
Hot only were the ancestors of Modern Indo- Aryan languages (namely Sanskrit 
*nd the Prakrita) spoken in the same region, but also the forerunners of 
■odera Tamil and Munda. Diachronically speaking, Sanskrit (both Vedic and 
Classical) is considered Old Indo- Aryan, and the Prakrits are traditionally 
considered Middle Indo-Aryan. But many (e.f. Emeneau 1966) have noted that 
Sanskrit and Prakrit were also spoken during the saae time period. 

Although Indo-Aryan scholars have continually referred to the Prakrits 
as the popular dialects and to Sanakrit as the language of the learned, the 
possibility of diglossia existing in ancient India was not discussed in depth 
until Hock and Pendharipande (1976).! Even ao, later scholara have not 
expanded on the hypothesis of diglossia during ancient times; Deshpande (1979) 
discusses instances of conflicting aociolinguiatic attitudes in ancient India, 
but doea not provide direct evidence for or against digloasia. 

The purpose of this paper is to evaluate the evidence (preaented primarily 
by Hock and Pandharipande) in favor of a digloasic relationship involving 
Sanskrit and the Prakrits in tenss of Ferguson's original 1959 definition . 2 
In aaking their claim that Sanskrit and Prakrit were used in digloasic 
situations as early aa the time of the Rig Veda, Hock and Pandharipande give 
three types of evidence. They present aa the best-known evidence the language 
differentiation in the Sanskrit drama, in which Sanakrit was used by 
characters representing the higher castes and various Prakrits were used by 
characters representing the lower social castes. 

Also cited aa evidence are various passages from the primary Sanakrit 
literature, moat notably from the writinga of the grammarian Patafijali (c. 150 

i^/ ata5j note * in r ? ferri »* to Panini (1.1.1., 259:13) that there 
are differences between the sistabhaaa . the language of the learned, and 
the lokabhasa, the language of the common people. The Natyaaastra, 
the oldest treatise on Sanskrit drama (attributed to Bharata, c . third century 
A.0. ), gives factors which determine whether s character may or may not use 
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Sanskrit. The important factors were aocial status, caste, occupation, and 
social context. The level of education was an important factor, for 
well-educated people were to use Sanskrit. 

Hock and Pandharipande Mention briefly, as a third type of evidence, the 
occurrence of so-called hyper-San&kri r isms, hyper correct ions of Sanskrit 
(or, in most cases, Prakrit) form which are intended to avoid patterns 
found in Prakrit. 

The evidence supplied by the ancient Indian coHsentators and grammarians 
provides strong evidence for, if not diglossis, at least some level of 
conflict between the two language varieties. Such evidence will not be 
disputed here. What is open to question, though, is the evidence from the 
Sanskrit drama of the period 100-1000 A.D. It appears that the dram say not 
be a reflection of the actual structure of ancient Indian society. On the 
other hand, though, the evidence provided by hyper-Sanakritisme can be shown 
to be sore important to the argument for diglossia than what Hock and 
Pandharipande claim. Thia paper contains the results of a systematic 
investigation of hyper-Sanskrit isms. 



2. The Evidence from the Dram 

The Sanskrit drama provides evidence for, at the very least, the literary 
coexistence of Sanskrit and Prakrit. In general, Sanskrit was used by 
characters of the higher social castes; within the same play, various types of 
Prakrits were used by characters of lower aocial groups, which included 
comic characters and women. According to Rajasekhsra (c. 900 A.D.), a 
dramatist who had a special interest in language, Prakrit i* "smooth" 
(hence, its general use by women) while Sanskrit is "harsh** (hence, its 
general use by men.) Although the Natyasastra gave elaborate rules for 
the use of language in the drama, such rules were by no means rigid. A 
considerable amount of variability existed, particularly in the use of 
Prakrit. 

Authority figures such as kings and genera la were to use Sanskrit; snd as 
might be expected, Brahmins were also to use Sanskrit. Some female characters 
used Sanskrit: the chief queen, the ministers' daughters, and occasionally 
Buddhist nuns, female entertainers, women artists, and allegorical female 
characters. Without fail the descriptions of battles, peace negotiations, and 
omens required the use of Sanskrit. 

On the other hand, the Prakrits were used by women other than those 
mentioned above, as well as by men of lower rank. Particular dialects were 
ascribed to particular types of people, although the use of a particular 
dialect differed from author to author. S auras en I was generally used by 
women of "good family", their servant a, and Middle class males. MagadhI, 
another well-known Prakrit, was used by men living within the women's 
spartments, diggers of underground passages, bartenders, and, interestingly, 
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by the hero in times of danger (possibly expressing bis "feminine", emotional 
side). Gamblers used Avanti and Dakainatya. Sometimes, two varieties 
of Prakrit were used within the sane' play: Kali das a (c. 400 A. D.) used 
S auras en £ in prose, Haharastri in verses. 

However, according to one of the earliest scholars on Sanskrit dram, 
Sylvain Levi ( 16 theatre indien , 1890), the dram could not have reflected a 
digloasic situation. The plays, in his view, were originally composed in 
Prskrit. As a result of the rise of Sanskrit as the language of literature 
as well as religion, the drama developed a Mixture of the two varieties. 
Moreover, Levi argued that "India ... was never anxious for contact with 
reality, and it is absurd to suppose that the aixture of languages was adopted 
as a representation of the actual speech usage of the ti»KJ . . . " (quoted in 
Keith 1924: 46). 

But the evidence so far is that the drama was not secular in origin, but 
religious, arising fro* epic recitations. Moreover, in the work of the 
earliest known Sanskrit dramatist, Asvaghoss (c. first century A.D.), 
Prakrit appeared mainly in the dialogue, wnile Sanskrit appeared mainly in 
the verses. Thus, it appears that in the early dramas, Prikrit was 
introduced into what was essentially a Sanskrit drama, in order to reflect the 
status of the inferior characters. 

Other arguments can be made that the language usage in the drama cannot be 
due simply to an imitation of the real life situation. The Prakrits of the 
later dramas were in some respect different from the Prakrits spoken in 
everyday situations. As early as 400 A.D. , the Prakrits used in the drama 
began to take on artificial, literary forms. Reference is made to 
vibhssss, stereotyped variants of the "more normal" Prakrits, which 
refer *to some literary Prakrits. For example, people of menial occupations 
used certain Prakrits: herdsmen used Sabar$ or Abhlr£; charcoal 
burners, hunters, and carpentera also used Sabari. (But the existence of 
literary forms does not necessarily mean that the Prakrits used in the drama 
are completely unreliable as evidence; in a study of Irish literary dialects 
Sullivan (I960) argues that literary dialects can reflect characteristics of 
the actual speech.) 

Moreover, there is evidence that the drama appealed to only a limited 
Indian audience and was intended to be viewed only by members of the higher 
social classes. As early as 900 A.O., chayas , translations of the 
Prakrit portions into Sanskrit, were common. No ovidence exists for 
translations of the Sanskrit portions into Prakrit, which suggests that the 
dramas were written mainly to be viewed by those who knew Sanskrit, i.e. the 
learned. Keith (1924: 242, 369-371) argued that the Sanskrit playwright's 
works were aimed mainly at the learned. Using (in part) information from 
unpublished texts, Balbir stated that ". . . the Sanskrit drama perhaps was 
never a light amusement of everyday life ... it is obvious that the Sanskrit 
drama was intended to be a drama of the elite, enjoyed by qualified persons . 
. . a refined product religiously presented as an offering before a 
discriminating audience . . ." (1962: 44) 
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The drama could only be appreciated by a special group of people Who were 
not only trained to appreciate the aesthetic qualities of the dram, but Who 
could also be empethetic with the characters on stage. Appreciation for the 
drama could only be cultivated by a certain aaount of study. The ideal 
spectator had to be knowledgable about many things, among thea the H rules of 
dialects . . . (and also) grassier" (Balbir, quoting from the 
Natyas'astrn). Citing Levi, Balbir states that M all the spectators are not 
apt to relish the rasa [* taste, feeling']; it is a sort of prize one has to 
deserve after an assiduous study of poems and healthy and delicate i sprees ion 
accumulated fro* the previous births. " These people are referred to by 
various terns in the primary literature: as preksaka , aamajika . sabhya, 
and sabhaaada . 

Sabhasada refers to 'au assistant at a aeeting or assessor in a court of 
justice.' Preksaka Beans 1 looking at, viewing rr intending to view', as well 
as 'spectator, aeaber of an audience'; but it could also have the meaning of 
1 considering* or 1 judging'. Saaajika is a tera that was neutral in 
meaning, meaning 'spectator, aeaber of or assistant at an asseably'. Sabhya 
could be neutral in aeaning as well, meaning 'being in an asseably hall or 
aeeting room, belonging to or fit for an asseably or court'; it could also, 
however, mean 'suitable to good society, courteous, polite, refined, 
civilized, not vulgar, decorous' (as speech); or *a person of honorable 
parentage'. Such spectators were, for the most part, members of the higher 
social classes. 

It was essential that audience members be well-qualified to view the 
Sanskrit draaa, for the audience members decided whether the play was a hit or 
not. Every ancient Indian audience had a sabhapati (literally 
'audience-ruler'), the guest of honor, who made the final decision as to the 
success of the piay. Hie sabhapati had advisors to guide hia in his 
decision; each advisor was a specialist on a particular aspect of draaa. Also 
present at the Sanskrit draaa were "assessors" , people of various occupations 
whose job was to evaluate the acting of individual performers. What is of 
interest here is that grammarians were also present as assessors. 

The coaaon folk also attended dramas; their opinions on the success of the 
play were acknowledged, hut were not respected. According to the 
Nstyas'astra, the audience was divided into two types: divine and human. 
The divine refers to the "cultured audience who generally take interest in 
deeper and aore subtle aspects of a dramatic and as such are above ordinary 
human beings" (Balbir quoting translation froa Ghosh, p. 513, fn. 17 It 15). 
The human element refera to the coaaon people who were appreciative only of 
superficial aspects of the draaa, and not of the deeper aspects. 

Certainly the Sanskrit draaa was something that was staged only on special 
occasions, such as ailitary victories, festivals honoring the gods, or 
weddings. The playhouses (the natyavesma . natyagrha, and preksagrha ) 
are described in the literature as having elaborate seating arrangements, with 
the best seat in the house given to the sabhapati . In some instances, they 
are referred to as "palace-theatres", which aay indicate that some plays were 
staged within makeshift theatres within the royal palaces. 
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Given that the Sanskrit dram was viewed by a limited audience, the 
bilingual nature of the Sanskrit drama does not provide conclusive evidence 
for diglossia. Stronger evidence for the high social status assigned to 
Sanskrit coses from hyper-Sanakritisms. 



3. The Evidence from ffyper-Samtlrrltj mm* 

Linguists have devoted a fair amount of attention to hypercorrections, the 
use of a form based on attempts to avoid forms found in low prestige 
dialects. DeCamp (1972) Mentions various examples of phonological 
hypercorrection in American English, such as /r/ inaertion in so»e San 
Francisco dialects, or Jamaican Creole substitution of /e/ for /t/ in words 
such as /flier/. In such forms, there is an effort, conscious or not, to 
avoid using forms which are phonologically similar to low prestige forms, even 
if they are not low prestige pronunciations, /r/ insertion appears to have 
originated from an attempt to avoid using what could appear as /r/ deletion; 
the /e/ for /t/ substitution resulted from an awareness of the converse 
substitution in low prestige dialects. In addition, labov (1972) describes 
hypercorrections in terms of the frequency of usage of correct forms; the 
middle class is likely to use prescriptively correct forms more often than 
higher social classes. 

In this discussion, I am using the term hypercorrection in a more 
general sense than what has been traditionally used: to refer to any 
morphological change which originates as an attempt to avoid using forms which 
contain phonological patterns found in a low prestige dialect. Since the 
original forms do not violate phonotactic (or syntsctic) rules, such 
"corrections" are unnecessary from a structural viewpoint; hence, they are 
Aflper-corrections. Traditionally, hypercorrections have been used to 
refer to prescriptively/etymologically incorrect forms which originate in such 
■anner, but prescriptive or etymological correctness/incorrectness is 
unimportant. What is important i 3 the aocial forces behind such 
modifications. 

Perhaps the best examples of such forms found in a language not usually 
considered a living language are found in the hyper-Sanskrit isms, phonological 
hypercorrections (limited to certain lexical items) which originated as 
modifications of Prakrit forms, or of Sanskrit forms which contain patterns 
found in Prakrit. Some, if not all, Sanskrit speakers must have been aware 
of the phonological differences between Ssnskrit and the Prakrits. In a few 
instances, Prakrit words which are borrowed into Sanskrit are modified to 
sound more Sanskritic. For example, Sanskrit has a noun utkuruta- 'dustheap', 
which originates as a hypercorrection from the Prakrit form having the same 
weaning, ukkurudi-. The Prakrit reflex of Ssnskrit tk is kk. From a 
phonological standpoint, there is no motivstion to change the kk sequence to 
tk because kk can occur in Sanskrit, as in Skt. kskkola- 'a species of 
plant*. The only motivation for such a change, if not due to loan phonology, 
is a social one: Sanskrit speakers wsnted to avoid using the kk sequence 
which, in principle, could be perceived as a Prakrit sequence I s 
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In soae cases, words which were Sanskrit in origin were erroneously 
perceived as Prakrit and Modified so as to be "more Sanskrit**. Utsuka- is a 
Modification of Sanskrit *ucchuks-, which comcs froM Old Indie *icchuks-. 
Since the cch sequence in S ucchuks - is identical to the cch sequence which is 
the Prakrit reflex of Sanskrit ts, the Sanskrit form changed in a direction 
away froa (what was perceived as) Prakrit. 

I examined all cases of hyper-Sanskrit Isms ( primarily) from two sources. 
One of the earliest works which refers to hyper-Sanskrit isms (and uses the 
term hyper-Sanakritism) is Blooafield and Bdgerton'a work on Vedic 
phonetics (1932: 20). The influence of Prakrit on Sanskrit is manifested in 
two ways: first, by Prakritisms, changes in Sanskrit formu in the direction 
of Prakrit. Many writings in Vedic Sanskrit (including the Rig Veda) 
contained unusual Sanskrit forms which are phonetic variants that follow sound 
patterns in Prakrit. For example, the form tyastr- 'creator' has a variant 
form tyastri-, which appears to be influenced by" the occurrence in some 
Prakrits 'of ri (or ru) for Sanskrit r. Secondly, the opposite may happen: 
the Sanskrit form may have a variant* form which is modified in a direction 
away from Prakrit-like forms, or toward a variety of Sanskrit which cannot 
be perceived as having any Prakrit influences, as in the 
hyper-Sanskrit isms. The hyper-Sanskrit isas cited in Blooafield and Bdgerton 
appear to be hypercorrected forms of Sanskrit forms erroneously perceived as 
Prakrit. It is these types of hyper-Ssnskritisas which Hock and 
Pandharipande cite as evidence for diglossia. 

Mayrhofer (1956) takes s different approach to hyper-Sanskrit isms. He 
defines a ifypcraanskritiaaua in the following way: 

Perhaps still more frequently than the undertaking of the pure or 
almost unchanged dialectal forms was also the case that these have 
been again adapted falsely to the high dialect. . . In several cases 
... we encounter strange Rihk-Sanskri ti aienmgeo of such Middle 
Indie (or, even only to be regarded as M[idd)e] Ifndic], in truth 
correct Old Indie) words and these ftickbi ldungen sre again a 
fact, which the Old Indie etymology by all means has included, 
(my translation of Mayrhofer 1956: 9) 

In volume I of Mayrhofer 'a work, I examined each entry to aee whether it 
could be attributable to a hyper-Sanskrit izat ion. 4 (Unfortunately) Mayrhofer 
uses five terms to refer to such hypercorrectiona: Uyperaanskri tiamua, 
Rtick-Saaakritisiervog, Riickbildwg, falache Sanakritisiertmg, and 
(occasionally) Sanskrit i&ienmg. These are distinguished froa 
Prakritisms ( [eio] Prakritiawua or dialektiacbe Formen) . 
Mayrhofer is aainly concerned with Modifications in Prakrit forms which 
eliminate certain patterns found in Prakrit. However, such modificstions 
are, from a social standpoint, the aame type of modifications that occur in 
Blooafield and Bdgerton's hyper-Sanskrit isms. 
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In examining hyper-Sanskrit isms, I found that they are not limited to only 
one or two categories, but that there were apparently Many types of 
hyper-Sanskrit laws that took place. This has two important implications. 
First, the occurrence of such types suggests that hypercorrection my play a 
greater role in morphological change than previously thought. Many have 
acknowledged that language change can arise as a result of speakers' tendency 
to regularize, as in analogical change. Occasionally morphological changes 
occur which involve apparent reversals of established sound correspondences. 
The best explanation for such reversals, especially in situations involving 
literary and colloquial variants, is hypercorrection. Thus, social factors 
can play an important role in accounting for changes in the phonological shape 



On the basis of the similarity in the types of forms found, as well as the 
variety of types, it appeara that hyper-Ssnskritisms are not a "grab-bag" 
group of words whose phonetic ahape cannot be explained, but rather are words 
which reflect an actual aociolinguistic phenomenon in ancient India. It could 
not aimply be a coincidence that all of the patterns found involved a change 
from (apparent) "Prakrit" to "Sanskrit"; the only possible motivation for 
such changes is hypercorrection. Though there is no semantic pattern in these 
forms, the hyper-Sanskrit isms fall into a set of distinct groups, as 
follows: 1 * [note: unless otherwise indicated, the original forms are Middle 
Indie; forms which are indicated as variants come from original Vedic forms: 
MI = Middle Indie, 01 = Old Indie] 



Modifications of Consonant Sequences: 

a. One of the Prakrit reflexes of Sanskrit ts is c(h) (frequently doubled 
to cch), as in Skt. matsara-, Pkt. macchara- 'cheerful; intoxicating', cch is 
a possible (and common) word-internal sequence in Sanskrit, as in gaccha- 
*tree\ A number of hyper-Sanskrit isms were found involving ts. for c(h)/cch: 



guts a- from guccha- 'bundle' 

utsuka- from *ucchuka-, 01 icchu-, 'restless, anxious, longing for' 
utsadana - from ucchadana - 'rubbing' 

kudyamatsr - Zkudymatsya- from *kudemac(h)- 'house lizard' 
Jugupsa- 'avoids, detests' fros MI »Juguechn - (Pali ji guccha -). 
'abhorrence'; desiderative of guy- 'protect'. 



b. Prakrit occasionally has j[k)kh for Sanskrit ks, as in Skt. bhiksu-, pkt. 
bhikkhu- 'monk'. In Apabhramaa, such a change occurs regularly, as'in Skt. 
ksptriya-, Apa(t . khattiu - 'warrior'. (k)kh was possible in Sanskrit, as in 
kfiakkhati '(s)he laughs'. Nonetheless, Sanskrit speakers substituted ks for 
(k)kh in some words: 



ERIC 




- 158 - 



aksauhinl - froa MI » akkhohini -> Pali akkhobhani - 

* 'complete aray' 
ksitra- froa khatra- 'breach, tunnel' 
ksiv - froa khiv- 'spits' 
ruksa - froa MI rukkha-, 01 vrksa- 'tree' 
ksvel - froa khel- 'leap, jump, *play' 



c. The Prakrit n.-tlex of Sanskrit tk is kk, as in Skt. utkara- . Pkt. 
ukkero, 'heap.' kk is a possible Sanskrit sequence, as in kakkola- 'a species 
of plant'. I found one example of a semantical ly related hypercorrected 
fom: utkuruta - for ukkn-urll- 'dust heap' ; also, aukta- froa tmutta -. 

Pali, Pkt. mutta-, 01 murta- 'pearl'. 

d. Prakrit Jd can also arise froa Sanskrit rk, as in Pkt. akka -, Sanskrit 
arka- 'ray, flash of lightning; sun.' In one hyper-Sanskritisa, rk is 
subtituted for kk: kurkuta- froa older, literary kukkuta - 'cock.' 

e. In some hyper-Sanskritisas, tt becaae st(h ): kandostha - froa Pkt. 
kaadotta -, kamdvtta -, 01 kan data- *-* 'blue lotus'; adhyusta - froa MI addhutta-, 
01 ardhacaturtha -* 'three and one-half. Nuaerous examples of tt occur in* 
Sanskrit: atta- 'watch-towe**; market.' s'ch did not regularly T u"ecoae tt in 
Prakrit, but 'compare st> tth in foras such as Skt. drsti -, Pkt. ditthp 



f. Prakrit shows bbh for Sanskrit dbh . as in Skt. sarfbhava-, Pkt. 
aabbhava - 'good nature.' One type of hyper-Sanakritiua involved dbh for 
bbh : adbhis/adbhyas (instr/dat, abl pi. of ag-) froa ♦ abbhis . abbhyas 
'water'. A compound for* abbhaksa- 'living upon water' shows that bbh can 
occur in Sanskrit. 

g. Sanskrit rv becaae yy in Prakrit, as in Skt. sarva -, Pkt. savya-, 
'all.' yy became ry in hyper-Sanskrit isms: 7 



h. In one hyper-Sanskrit ism, vg comes froa gg, as in argala - froa MI aggala-, 
01 t atfra -Ia- 'going beyond'. is a possible sequence in Sanskrit, as in the 
coapound 'igga.ia - 'one of the elephants in the four quarters (who support the 
earth) ' . 



Modifications of Individual Canrvjauts: 

i. Dialectally in Prakrit, d was substituted for t in a liaited set of 
words, all of which are forms of the second person singular pronoun, e.g. 
dava for tavat 'your. ' In some words, Prakrit shows d for Sanskrit t, 
as in Skt. parita -. Pkt. uarida - 'around.' A few Prakrit isms involved the 
interchanging of voiced stops for voiceless stops, as in the case of edagva - 



'sight'. 



urvarita - from uvvaria -, M eft, lef*. over'. 

car;— for 01 *cayy-, 'grinds with th^ teeth, chews' 
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for etagva- 'of variegated color, shining (horses)', and piga - for pika - 
•cuckoo'. Hypercorrections in the other direction occur as well: deyi- 
* (perhaps) nursery tent for Mother' has the variant font tevi- . 

j. Some Prakritisms show v for p., as in Janovada- fro* 01 janapavada - 
•gossip, ill report'. The converse hyper-Sanskritisa occurs: kapata - from 
kayita- 'leaf of a door.' 

k. Prakrit regularly shows h where Sanskrit has aspirated stops, as in the 
following: Skt. sukha -, Pkt. suha - 'pleasure'; Skt. laghuka -. Pkt. lahua- 
'amall o^e'; Skt. pathika-, Pkt. pallia- 1 traveller '; Skt. nidhi -. Pkt. nihi- 
'treaauie'; and Skt. abhinava -, Pkt. ahinava - 'fresh'. Some Prakritic fotas 
showed a substitution of h for aspirated stops, as in kakuha- from kakubha- 
Miigh, eminent, great'; gahana - from gambha - 'deep'. The corresponding 
hyper-Sanskrit ism of dh from h occurs: 

gudhera - 'rom guhera- 'protecting' 

avadhamaa - from Pkt. ohaaao-, 01 * avagharsa - 'red sandal* 

1* In soma Vedic foras, j was substituted for original d 'especially before 
y.): dyut- has the variant jyut - 'shine'; original daha , imperative of han- 
'strike, kill', became jahi . The opposite hypercorrection occurs as well: 
jya- 'bow string', has the variant dya-. 



Vocalic Hyper-Sanskrit isms: 

a. Prakrit frequently reduced word final -as (-ah) to -o, as in Skt. 
druses, Pkt. Hump, 'tree.' Final -o occurred in Sanskrit as a result of a 
sandhi rule involving the change of final -as to -o before voiced consonants, 
as in devo gacchati (from underlying devas gacchati ). As might be 
expected, as is substituted for o in hypercorrections: 

amas for 01 tamo 'this', nom. sg. 
adas for 01 » ado 'that', uoa. sg. 

n. Prakrit occasionally . i (and sometimes a) in place of vocalic r, as 
in Skt. drdha-, Pkt. dadha - 'firm'; Skt. aartg-,~Pkt. amia - 'nectar' an5 Skt. 
prakrta -,*Pkt. paua- 'Prakrit'. Numerous Prakritisms. show a 
substitution of i for vocalic r as ir. pinnate fro« 01 *^rbhnati, third 
singular present of grabh- ( grlh -), 'takes'. Some hyper-Sanskrit isms have r 
interchanged with i or u: r 

krcchra - from *kicchr§- 'evil, bad' 

masrna - froa MI masina -, 01 mrtsna - 'soft, mild' 

( gotra)bhrd , a variant of gotrabhid 'opening the cow-pens of the 

sky' (of Indra and Brhaspati's vehicle) 
laivatrka - froa 01 t jaivatu-ka - (vrddhi of Jivatu -) , 

' long-lived' 
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rcchwra- Courtesan 9 froa acchara- 'Apsaras' (name of female 
• divinity) 

Hhusrna - froa Pkt. ghuaina - 9 saffron 9 

o. Tha Prakrit dialects occasionally had foras with ru or ri for Vedic r 
(paralleling the Modern pronunciation)! as in hhruai- for bhrai- 
*whirlwind* . There is at least one hyper-Senakritisa corresponding to this: 
prtva - occurs as a variant of pruava- 'drop of water, riae, ice*. 



4. Conclusion 



The issue is whether Sanskrit and the Prakrits, which were used by 
speakers within the aaae speech coaaunities, could be accorded diglossic 
status. Certainly there ia evidence for the existence of a high and low 
variety, with Sanskrit holding the position of high prestige and Prakrit, 
low prestige, as Hock and Pandharipande argue. But the evdence froa the 
Sanskrit drama does not conclusively prove the existence oi diglossia, since 
the draaa was written aainly for audience aeabers who were vssentially the 
upper crust of society and, as a possible consequence, did not accurately 
portray actual language usage. It is also likely that the usa of Sanskrit and 
Prakrit in the draaa, especially in the later works, was aereiy a aatter of 
literary tradition, rather than a depiction of the real-life situation (Burrow 
1973: 60; cf. also the occasional stereotyped use of Souther* accents for 
inferior characters in Aaerican English). 

The hyper-Sanskrit iaas, however, are stronger evidence for diglosaia. 
Since neither the absolute number of hyper-Sanskritisas '*ound nor the absolute 
number of hypercorrection patterns provi;!- sr-rfusiv*: evidence for diglossia, 
ay intention is not to provide a statistical argument for diglossia. There iv 
no H aagic number" of hyper corrected f jras or patterns which conclusively 
indicates that speakers viewed each variety aa having different social 
status. Moreover, the number of hyper-Saoekritisas found in Mayrhofer'a 
dictionary does not provide a figure for the token frequency of worth 
which underwent such hypercorrection. Soae foras occur more frequently than 
others. In addition, the exiatence of hypercorrections in itself does not 
signal diglossia, since hypercorrections (of both phonological and 
aorpho-syn tactic nature) occur in non-diglossic situstions, such as Aaerican 
English. 

But if hypercorrection played only a ainor role in accounting for 
morphological change within a language, then one would not expect to find aeny 
different types cf hypercorrection. Certainly the occurrence of only one or 
two patterns could not be used as evidence for differing social attitudes 
toward the dialects. The large variety of hyper-Sanskrit isas, with numerous 
different patterns, strongly auggests that there were conscious efforts on the 
part of Sanskrit speakers to avoid using foras which sounded Prakrit ic. 

It appears that the Prakrits were not simply the dialects iu»ed by the 
popuJj t but were varieties that had low social standing. Sanskr'-t was, in 
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addition to being the language used by the learned, a variety that held much 
greater prestige than the Prakrits. Thus, in such the aaae way in which 
ancient Indian society was stratified, Sanskrit and the Prakrits were also 
socially differentiated. 



My thanks to Brian Joeph for his comments on earlier versions of this 
paper. 

1. De Silva (61-62) argues that, as early as 800 B.C., Vedic and 
Classical Sanskrit were used diglossically, with the Classical language as the 
high variety and Vedic as the low variety. 

2. Ferguson' a definition of diglossia, in its entirety, is as follows: 



... a relatively stable language situation in which, in 
addition to the priaary dialects of the language (which «ay 
include a standard or regional standards), there is a very 
divergent, highly codified (often grammatically aore coaplex) 
superposed variety, the vehicle of a large and respected body of 
literature, either of an earlier period or in another speech 
coaaunity, which is learned largely by formal education and is 
used for most written and formal spoken purposes but is not 
used by any sector of the coaaninity for ordinary conversation. 
(1969; 336) 



In this discussion, I am using a simplified version of Ferguson's 
"classic" definition: namely, situations involving a variety which is 
assigned high social status, while the other variety is regarded as having low 
status by speakers. This simplified version appears to be the sole criterion 
used by Hock and Pandharipande in their analysis (113); they do not discuss 
criteria other than prestige. The criterion of function, with mutually 
exclusive tasks assigned to each variety, is a natural consequence of the 
occurrence of high and low varieties. 

Also, this simplified version represents the essence of Ferguson's 
definition, which distinguishes diglossic situations from cases involving 
regional and stylistic variation. The two varieties must have a moderate 
amount of divergence, in the sense that they must be different enough so as 
not to he styles, but they must he similar enough so as not to he unrelated 
languages. Ferguson's definition differs significantly from Fishman's (1972) 
and Faaold's (1984) later modifications. Fishman agrees with Guaporz's 
argument (1961, 1962, 1964a, 1964b, 1966) that diglossia involves two 
functional ly differentiated language varieties of any type t regardless of 
their degree of divergency. According to Fishman, "diglossia is a 
characterization of the social allocation of functions to different languages 
or varieties" (1972: 102). Hence, the functional difference between the 
varieties is more crucial to Fishman (and Fasold, who agrees with Fishman) 
than their prestige. The only criterion which all have agreed on is function, 
with only alight overlapping of the social tasks assigned to each variety. 
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Hence, Fiahaen (by implication) and Fasold (explicitly) include regional 
and stylistic variation. But there are no real high or low varieties in such 
cases; speech styles do not carry the same social connotations that true 
"high" or "low" varieties do. Both Fiahaan and Fasold' s views trivialize the 
notion of diglossia, since sny stable situation in which two or wore varieties 
are- spoken within the save speech community would be diglosaic. 

My goal is not to argue for diglossia involving Sanskrit and Prakrit in 
tenaa of all characteristics stated by Ferguson; I leave that to present and 
future Sanskrit scholars. 

3. Burrow (1973: 61) points out that such modifications (which he tens 
[false] Sanakritization) abound in Buddhist Hybrid Sanskrit, which is 
essentially a Sanskritized Prakrit. Here, many Prakrit words are Modified 

to take on Sanskrit patterns, as opposed to Merely substituting the equivalent 
Sanskrit word. B.g.» Pkt. bhikkhussa . gen. sg. of bhikkhu 'monk* is changed to 
bhiksusya , thus "undoing* 1 the changes froa Sanskrit to Prakrit. (Skt. ay. 
became sn in Pkt. as in Skt. tasya . Pkt. taasa *his'; cf. also the Sanskrit 
equivalent font bhikaos . ) The discussion centers only on changes" in Vedic and 
Classical Sanskrit, although the evidence froa Buddhist Sanskrit does not 
detract froa the argument. 

4. Mayrhofer is less certain of the origin of scare forms than of others 
(vielleicht Hypemanskritismts) . With the exception of aoae fonas which 
Mayrhofer explicitly stated could not be hyper-Sanskrit i was, I considered any 
fona that could be a hyper correct ion to be sn actual hyper-Sanskrit iaa. 

5. Andronov (1977) invokes hypercorrection as au explanation for certain 
morphological changes in Dravidisn. (Only one of hie examples is an actual 
hypercorrection; the remaining appear to be due to folk etyaology or 
analogy.) The colloquial varietiea of Taail and Malaysia* ahow an alternation 
in roots between i/e and between u/o, with the high vowel a lowered to their 
aid coimterparta when the vowel in the following syllable is a. Literary 
Taail and Ma lay am, however, show no alternation; only i and u occur under this 
condition. Rarlier scholars have disregarded these facts because they would 
involve the following sequence of events: first, Proto-South-Dravidian 
contained high vowels which were lowered before a ayllable containing a. Then 
these aid vowels were raised in Proto-Tamil-Malayalam, followed by lowering in 
colloquial Taail and Malayaa, but not in the literary forms. However, there 
is no motivation for such a chronology. 

According to Andronov, hypercorrection ia the only logical explanation. 
Vowel lowering occurred occurred only once, in Proto South Dravidian. 
Bducated Taail and Malayaa speakers felt that such lowering was "incorrect" 
Taail. In the early stages of Taail, e and o could occur before syllables 
containing a which were not derived froa i or u, but which were originally mid 
vowels. Speakers of what came to be known as literary Taail (the high 
variety) retained the original high vowels before a, and raised the original 
aid vowels before a so as to not sound like speakers of the colloquial variety. 
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6. This list is not by sny Means a complete list of hyper-Sanskritisms. 

7. I have been unable to find any attested Sanskrit for* containing yv. 
This is somewhat unusual because y., which, like y, i a a semi-vowel, can occur 
as s geminate (aayyaaanabbogas. 'lying, sitting, and eating'). But there 
are situations in which yv could potentially occur. Whitney (section 228) 
Mentions that consonants (except for spirants preceding vowels) could 
optionally (and sometimes obligstorily) be geminated after r (snd, for soMe 
graMMarians, h, 1, or v). Citing Hock and Pandharipande (p7 116), Brian 
Joseph pointed out to Me that geMinatioo in taunts was prescript ively 
incorrect ( uutradini , not puttradinl 'cruel Mother 1 ), implying that 
Sanskrit speakers did geminate consonants in such forms. Also, two 
secondary sources (Coulson 1976: 24 snd Kale 1969: 10) give ligatures for yv. 
However, they cite no forms containing this sequence; perhaps these ligatures 
are hypothetical. 
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